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EXTINCTION FOLLOWING PARTIAL AND 
VARIED REINFORCEMENT 


FRANK A. LOGAN, EILEEN M. BEIER, AND WENDELL D. KINCAID 


Yale Unwersity’ 


A partial reinforcement training 
procedure typically results in greater 
resistance to extinction than con- 
tinuous reinforcement (6). A number 
of hypotheses have been advanced in 
an effort to account for this empirical 
generalization (e.g., 2, 5,9, 10, 13, 14). 
None of these appears completely 
satisfactory within an S-R reinforce- 
ment frame of reference, although 
some may provide a partial answer. 
It is our intent to provide new data 
which bear uniquely on this problem, 
and to offer one analysis which these 
data suggest. 

Partial reinforcement confounds the 
effects of two more basic operations: 
varied delay of reinforcement and 
varied magnitude of reinforcement.’ 
That is to say, magnitude of reinforce- 
ment is sometimes a finite quantity 
and sometimes is zero; likewise, delay 
of reinforcement is sometimes a finite 


1 This research was supported in part by a 
grant from the National Science Foundation. 

? Delay and magnitude may be uncorrelated 
so that four reinforcement experiences are 
available using two magnitudes and two delays. 
Partial reinforcement is a still more special case 
where delay and magnitude are correlated pro- 
viding only two reinforcement experiences. 
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varied. 


interval and sometimes is (effectively) 
infinite. It was belief that 


our a 
study of varied delay and varied 
magnitude separately would place 


partial reinforcement in a more com- 
plete systematic perspective. 

Accordingly, during training for one 
group of Ss the delay of reinforcement 
was varied randomly between two 
equally likely values while magnitude 
was held constant; for another group 
only magnitude of reinforcement wae 
For each of these experi- 
mental groups there were two control 
groups, one of which a 
constant reward at the preferred of 
the two values and the other at the 
mean of the two values for the experi- 
mental groups. The acquisition re- 
sults were presented elsewhere (8); 
briefly, the varied delay group at- 
tained a performance level comparable 
to that of its preferred delay control 
group whereas the varied magnitude 
group closely approximated its mean 
magnitude control group. This paper 
presents the relative rates of extinc- 
tion following these training 
ditions, together with other relevant 
data. 


rec €iV ed 


con- 
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Variep DeLay ann Macnitrupe 
Experiments | ann Variep 
Macnirupe Experiment I] 


Method 


Subjects.—Subjects were black-hooded rats 
obtained from the Sennett Antique and Pet Shop, 
Tiffin, Ohio. They were 90-120 days of age at 
the beginning of their training. 

A pparatus.—The double-alley maze has been 
described in detail elsewhere (8). For the 
present design, Ss ran down a single straight 
alley, from which were automatically recorded 
starting speed (reciprocal of time to 6 in. from 
start door) and running speed (4 times reciprocal 
of time over 4 ft. of straight alley). 

Training procedure.—Routine habituation 
and preliminary training procedures were em- 
ployed (8). The Ss lived in individual cages, 
were maintained on a 24-hr. hunger cycle, and 
were run one trial per day. 

Experimental design.—During acquisition Ss 
received the following conditions of reinforce- 
ment. (A) Varied Delay Exp. I (all Ss received 
a constant 4-pellet magnitude of reinforcement) : 
(1) 1-sec. delay (preferred delay group, N = 8); 
(2) 5-sec, delay (mean delay group, N = 8); (3) 
l-sec. delay on a random half of each block of 10 
trials, and a 9-sec. delay on the remaining half 
(varied delay group, N= 7). (B) Varied 
Magnitude Exp. I (all Ss received a constant 
4-sec. delay of reinforcement): (1) 9-pellet 
magnitude (preferred magnitude group, N = 8); 
(2) 5-pellet magnitude (mean magnitude group, 
N = 8); (3) 9-pellet magnitude on a random 
half of each block of 10 trials and a 1-peilet 
magnitude on the remaining half (varied magni- 
tude group, N = 8). (C) Varied Magnitude 
Exp. II (all Ss received a constant 4-sec. delay 
of reinforcement): (1) 9-pellet magnitude (pre- 
ferred magnitude group, N = 8); (2) 9-pellet 
magnitude on a random half of each block of 10 
trials and nothing on the remaining half (partial 
reinforcement group, N= 8); (3) 9-pellet 
magnitude on a random half of each block of 10 
trials and a 1-pellet magnitude on the remaining 
half (varied magnitude group, N = 8). 

The above conditions were run for 50 trials 
for Varied Delay Exp. I, and 60 trials for the 
two varied magnitude experiments. The Ss in 
Varied Delay Exp. I were then given 20 non- 
reinforced trials; Ss in Varied Magnitude Exp. I 
were given 30 nonreinforced trials; and Ss in 
Varied Magnitude Exp. II were given 45 non- 
reinforced trials. 

Extinction measure.—The most generally 
useful extinction measure would be one approxi- 
mating the parameter of a decay function which 
determines the fractional decrement per ex- 
tinction trial. When extinction curves begin at 


markedly different levels, the raw speed data do 
not adequately provide graphic representation 
or statistical test of possible differences in this 
measure. The data were therefore transformed 
to yield a direct indication of relative rate of 
extinction. 

For each S individually, estimates of his 
terminal acquisition and extinction speeds were 
obtained; terminal acquisition speed was as- 
signed a value of unity, terminal extinction 
speed was set equal to zero. Speeds during 
extinction were then expressed proportionately 
on this scale; e.g., for an S with a terminal 
acquisition speed of .6 and a terminal extinction 
speed of .2, a speed of .6 was rescored 1.0, one 
of .5 became .75, etc. Medians of these trans- 
formed scores were then taken and are presented 
in this paper. ‘These operations, it will be seen, 
force all curves to begin at unity and approach 
zero, with the score showing the average per- 
centage of loss yet to be demonstrated, and with 
the curves therefore suggesting the relative rate 
at which extinction is progressing. 


Results 


Performance during extinction of 
the varied delay group relative to its 
two control groups in Varied Delay 
Exp. I is shown in Fig. 1. No differ- 
ence in relative rate of extinction is 
apparent among these three groups. 
Results of the Varied Magnitude Exp. 
I are not shown graphically. The 
two control groups did not appear to 
differ in relative rate of extinction, 
but there was a statistically unreliable 
suggestion that the varied magnitude 
group exhibited a slower relative ex- 
tinction rate. The Varied Magnitude 
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Fic. 1. Relative running speeds during ex- 
tinction are shown on the !eft for Varied Delay 
Exp. I, and on the right for Varied Magnitude 
Exp. IL. 
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Exp. II was intended to re-evaluate 
the suggested reduced rate of extinc- 
tion following varied magnitude train- 
ing, and to provide a partial rein- 
forcement control group. Relative 
extinction curves of these three groups 
are shown in Fig. 1. It will be noted 
that the varied magnitude and partial 
reinforcement groups do not appear 
to differ in relative rate of extinction 
but both show clearly slower extinc- 
tion rates than the preferred magni- 
tude (continuous) control group.* 
Statistical evaluation of these re- 
sults was obtained by first finding for 
each S his median relative speed for 
the first five blocks of extinction 
trials. The 24 Ss in each experiment 
were ranked and the Kruskal and 
Wallis H test (3, p. 423) was used as 
a test of over-all differences among 
the three groups. This test failed to 


reach conventional levels of statistical 
significance for Varied Delay Exp. I 


and for Varied Magnitude Exp. I. 
For Varied Magnitude Exp. I], 
= 13.528 which, with 2 df, is signifi- 
cant at beyond the .O1 level. The 
White 7” (3, p. 417) test was then 
used to evaluate separately differences 
among the groups in Varied Magni- 
tude Exp. II. 
magnitude 


The partial and varied 
groups did not differ 
significantly ; between the partial and 
preferred magnitude groups, 7’ = 39 
(ny = nz = 8, P < .O1); and between 
the varied and preferred magnitude 
groups, T’ = 46 (ny = 
P < 05). If the varied and pre- 
ferred magnitude groups from the two 
varied magnitude 


n, = 8, 


experiments are 

* Since the graphs are presented in terms of 
relative extinction, readers interested in raw 
speed curves can approximate these from the 
following figures. Acquisition and extinction 
asymptotes were, respectively: preferred magni- 
tude, .620 and .181; varied magnitude, .580 and 
.220; partial reinforcement, .537 and .222; pre- 
ferred delay, .544 and .105; varied delay, .522 
and .109; mean delay, .301 and .107. 


treated together, 7” = 204 (n,; = ny 
= 16, P < .05). 

The data were also analyzed in 
terms of the number of extinction 
trials required to reach a criterion of 
three trials with relative 
speed at or below .50 (i.e., to where 
extinction had progressed half as far 
as it was going toforthat S). Identi- 
cal levels of significance were obtained. 


successive 


Discussion 


Without here evaluating other pub 
lished alternatives, we offer the following 
analysis concerning experimental extinc- 
tion. The extinction of an instrumental 
response is due, at least in part, to the 
loss of incentive morivation. Incentive 
motivation in Hull’s system (4) is 
labeled K, and following Spence (11, 12), 
K is assumed to be dependent upon the 
fractional anticipatory goal 
(r,). The extinction of an instrumental 
response 1s therefore related to the ex 
tinction of rz which is, in turn, a function 


response 


of the number of unreinforced evocations 
of r,. If the conditioned anticipatory 
r, on any trial is larger than would be 
appropriate for the primary reward 
actually given on that trial (i.e., a change 
from a larger to a smaller, or to no re 
ward), the surplus r, may occur several 
times without reinforced. Fol 
lowing this kind of analysis, we propose 
the hypothesis that the loss of an instru 
mental response tendency which results 
from a nonreinforced trial is directly 
related to the extent to which r, persists 
in occurring after the time of reinforce 
ment has passed. 

From this 
expect resistance to 


being 


assumption, one would 
extinction to be 
greatest for Ss who, immediately upon 
failure to receive an accustomed reward, 
stop performing the anticipatory goal 
response, or in subjective terminology, 
immediately stop anticipating reward. 
We shall call the cues which occur after 
the regular time of reinforcement on a 
nonreinforced trial the postreinforcement 
time cues. Therefore, to predict relative 
resistance to extinction, one must deter 
mine whether the 


training conditions 
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allowed S to extinguish r, to the post- 
reinforcement-time cues, i.e., in everyday 
terminology, to take “No!” for an 
answer.‘ 

Fortunately, employing this hypothe- 
sis within the Hullian framework permits 
a formal rather than a casual derivation 
of whether any specified reinforcement 
procedure provides this extinction ex- 
perience. Since r, is acquired and 
extinguished by the laws of classical 
conditioning, one has only to examine the 
training conditions from this point of 
view. 

Consider first continuous reinforce- 
ment: r, will become conditioned directly 
to the prereinforcement cues, but will 
also have a generalized tendency to 
occur to the postreinforcement-time cues. 
The latter do not actually occur during 
training because the reward itself, by 
stimulus interaction, modifies the on- 
going traces. (That is to say, the post- 
reinforcement cues on rewarded 
trials are different from the postrein- 
forcement-time cues on nonrewarded 
trials because they are changed by the 
occurrence of the goal stimulus.) When 
reward is omitted in extinction, however, 
the postreinforcement-time cues do oc- 
cur, do evoke r,, and thereby produce a 
large extinction effect. 

Turning to varied delay of reward, the 


*This hypothesis is equally applicable to 
classical conditioning. In classical conditioning, 
the postreinforcement-time cues are the stimulus 
trace ages older than the age at which the UCS 
was previously given. Inhibition is a function 
of the number of unreinforced responses, but 
unless the response has been extinguished to the 
post-UCS cues, several unreinforced responses 
will occur on any one extinction trial. It should 
be observed that, using partial reinforcement, 
ages up to the time of the UCS receive 50% 
reinforcement to maintain the response, but 
post-UCS ages receive 100% nonreinforcement 
(because they occur only on the trials when the 
UCS is omitted) and the response becomes ex- 
tinguished to them. As in an earlier analysis 
(7), there is set up a discrimination between the 
post-UCS ages and earlier ages. This hypothesis 
is not presumed, by itself, to account for all 
extinction results following various patterns of 
reinforcement. ‘The Hull-V. Sheffield analysis 
is a logical consequent of the Hullian system and 
complements the present analysis, 


postreinforcement-time cues again do 
not occur on short delay trials. These 
cues do occur on long delay trials, 
however, and we must attend to the 
stimulus asynchronism between these 
postreinforcement-time cues and the 
subsequent reward. If the longer delay 
in varied delay is very long, then the 
occurrence of r, to the postreinforcement- 
time cues will be nonreinforced because 
the subsequent reward is too late to act 
as an effective UCS. If, on the other 
hand, the longer delay is not very long, 
then the occurrence of r, to the postrein- 
forcement-time cues will be maintained 
by the later reward as an UCS. In 
general, the tendency for the postrein- 
forcement-time cues to evoke r, will vary 
inversely with the difference in delays 
used in varied delay training. Re- 
sistance to extinction will therefore be 
increased by varied delay training when 
the difference in delays is large, and will 
have progressively less effect as the 
difference becomes smaller.® 

The effect of varied magnitude training 
will also depend upon the difference in 
magnitude. Varied magnitude training 
is assumed to involve two r,’s (that 
appropriate to the large, and that 
appropriate to the small, reward) both 
of which generalize to the postreinforce- 
ment-time cues and produce a compro- 
mise fr, intermediate between them. 
Thus, on small reward trials, there will 
be a generalized tendency for the 
compromise r, to occur to the cues 
present after consuming the small re- 
ward, because these cues are similar to 
the prereinforcement cues. Since no 
subsequent large reward will occur to 
serve as an UCS, r, will be extinguished 
to a stimulus pattern whose similarity to 
the postreinforcement-time cues depends 
upon the size of the small reward 
(because the smaller the reward the 
smaller the stimulus interaction with the 
on-going traces). Accordingly, the ex- 


* Varied delay may actually reduce resistance 
to extinction if the difference in delays is small 
because r, then receives strong direct condi- 
tioning to the postreinforcement-time cues as 
well as the generalized tendency previously 
described, 
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tent to which the postreinforcement-time 
cues lose their tendency to evoke r, 
(hence increasing resistance to extinc- 
tion) will vary directly with the difference 
between the two magnitudes.® 

These effects of varied delay and 
varied magnitude converge upon partial 
reinforcement as a limiting case in which 
the difference between the delays and 
the magnitudes is at a maximum. It 
therefore provides opportunity for the 
postreinforcement-time cues to lose their 
tendency to evoke r, (because on some 
of the trials these cues occur without 
interaction by a small reward and 
without a later reward as an UCS), and 
result in increased 
extinction. 


resistance to 


Variep Detay Experiment II 


Of the predictions yielded by the 
above hypothesis, the one chosen for 
test involves an examination of the 
differences between a varied delay 
experiment reported by Crum, Brown, 
and Bitterman (1), in which varied 
delay produced an increased resistance 
to extinction, and Varied Delay Exp. 
I which failed to yield this result. 
There were three notable differences: 
(a) Their procedure involved massed 


* This hypothesis should not be confused with 
an assumption that the stimulus change pro- 
duced by reward preserves learning prior to 
reward. If one chose to add the assumption 
that the occurrence of a nonrewarded 1, produces 
frustration, then the present hypothesis could 
be recast to describe extinction as a function of 
the frustration produced by nonreinforcement, 
with certain learning conditions providing ex- 
tinction of this frustration. This added as- 
sumption does not appear to add anything to the 
particular problem here considered, and is there- 
fore not introduced on the grounds of parsimony. 
Such an interpretation, however, may be con 
trasted with an hypothesis advanced by Wilson, 
Weiss, and Amsel (14) in which the frustration 
produced by nonreinforcement is presumed to 
increase during partial reinforcement training, 
to persist, and, as an associative variable, to 
bridge the time gap for application of the Hull- 
V. Sheffield kind of analysis. 


training trials whereas we used widely 
distributed practice. (b) Their Ss 
received reward immediately on half 
of the trials whereas ours waited | 
sec. on these trials. (c) Their Ss were 
delayed 30 sec. on long delay trials, 
whereas ours waited only 9 sec. 

Should a constant zero-delay group 
show the same rate of extinction as 
a 0-9 sec. varied delay group, using 
massed trials, we could conclude that 
varied delay does not always produce 
an increase in resistance to extinction 
even with massed practice and with 
an immediate reward on half of the 
trials. Such a finding would be con- 
sistent with the above r, hypothesis, 
and would rule out the first two 
differences noted between previously 
reported results. However, a group 
with a large difference (0-30 sec.) in 
delays should, according to hypothe- 
sis, show a reduced rate of extinc 
tion. 


Method 


Apparatus and procedure were identical with 
the three earlier experiments except that massed 
training (15-sec. intertrial interval) was em 
ployed. The design involved three groups (all 
Ss received a constant |-pellet magnitude): (1) 
O-sec. constant delay group (N = 5); (2) 0-9 
sec. varied delay group (N = 5); (3) 0-30 sec. 
varied delay group (N = 5). 
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Fic. 2. Total running speeds during 
extinction for Varied Delay Exp. IL. 
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Results and Discussion 


The results are shown in Fig. 2. 
(Since some Ss did not continue 
running on all trials with massed 
extinction after the first two days, no 
reliable index of extinction asymptote 
could be obtained and raw speed 
curves are therefore shown.) Analy- 
sis of variance on the second day of 
extinction yielded an F of 15.745 
which, with 2 and 12 df is significant 
at beyond the .0O1 level. Subse- 
quent ¢ tests show that the 0-30 sec. 
varied delay group was performing 
significantly (P < .0OO1) better than 
either of the other two groups, which 
were not significantly different from 
each other. 

The predictions based on the rf, 
analysis were clearly confirmed. 


SUMMARY 


This article reports a series of experiments in 
which were separately observed the effects of 
varied delay and varied magnitude of rein- 
forcement upon resistance to experimental 
extinction, The results of these experiments 
suggest: 

1. When delay of reinforcement is varied 
between two equally likely values, under either 
massed or widely distributed practice, rate of 
extinction is decreased only if the difference in 
delays is fairly large. Specifically, variation 
between 0- and 30-sec. delay increased resistance 
to extinction whereas variation between 0 and 9 
sec. and between | and 9 sec. did not. 

2. When magnitude of reinforcement is varied 
between two equally likely values, under widely 
distributed practice, the rate of extinction may 
be decreased. Specifically, variation between 
1 and 9 pellets increased resistance to extinction 
to an extent comparable with partial reinforce- 
ment (variation between 0 and 9 pellets). 

3. An hypothesis was advanced to account 
for these and related findings. Resistance to 
extinction is assumed to be increased by any 
training procedure which permits extinction of 
the fractional anticipatory goal response to the 
cues which occur on nonreinforced trials after the 
regular time of reinforcement. 
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REDUCTION OF SECONDARY REWARD VALUE AS 
FUNCTION OF DRIVE STRENGTH DURING 
LATENT EXTINCTION ! 


HOWARD MOLTZ AND SALVATORE R. MADDI 


Brooklyn College 


Several investigators (2, 6, 7, 9) 
have recently reported that certain 
pre-extinction procedures will weaken 
the strength of a previously learned 
response without that response itself 
being performed. These pre-extinc- 
tion procedures were designed to 
“inform” S that food was no longer 
present in the previously baited goal 
box. This was accomplished by 
placing S directly in the empty goal 
box for a period judged sufficient to 
allow him to perceive its present non- 
rewarding character. The effect was 
observed when S was subsequently 
run to extinction and its performance 


compared with that of a control group 


that did not receive pre-extinction 
training. Some investigators (1, 8) 
have not found pre-extinction training 
to be effective in producing extinction. 
Nevertheless, it seems that, in some 
situations at least, performance is not 
a necessary condition for extinction. 
An expectancy theory encounters 
no difficulty in providing an inter- 
pretation. ‘Tolman has stated that 
“The CR is a response to the sign 
relationship S.-+S;. And .. . [the 
response | goes off only so long as that 
sign relationship is ‘believed’ in by O” 
(12, p. 330). Thus, permitting S 
ample opportunity to observe that 
the reward object is no longer present 
in the goal location should be suffh- 
cient to weaken the expectancy of 
! This article is based on the master’s thesis 
of the second author conducted under the super 
vision of the first author. The first author is 


responsible for the design and for the manuscript 
in its present form. 


finding the reward object there again, 
so that subsequently S should require 
fewer trials to reach an extinction 
criterion. 

Hull’s theory of extinction, on the 
other hand, encounter 
difficulty in providing an _ interpre- 
tation, since reactive inhibition and 
conditioned inhibition could not de- 
velop unless the response, or some 
component of the response, was 
performed. However, in a_ recent 
study Moltz (6) has contended that 
while Hull’s response-induced inhi- 
bition theory requires the performance 
of the to-be-extinguished response, 
Hull’s general behavior theory makes 
no such requirement. In line with 
this contention Moltz advanced the 
hypothesis that, following training in 
a T maze, if S were placed directly in 
the goal box without the presence of 
the reward object (a procedure which 
he called “latent extinction’’) this 
would reduce or eliminate the second- 
ary reward value of cues present in 
the goal box, and that this would, in 
turn, lead to extinction through non- 


appears to 


reinforcement or a_ decrease in 
Analysis of the data 


revealed latent extinction to be effec- 


reinforcement. 


tive only when it resulted in a sig- 


nificant decrease in the secondary 
reward value of goal-box cues. 


Hull (3) the 


acquisition of secondary reward prop- 


has suggested that 


erties by a neutral stimulus is de- 
pendent upon that stimulus having 
conditioned to it fractional 
component of the goal response (r,) 


some 
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and, in addition, that the stimulus 
will retain its secondary reward value 
as long as it retains the power to 
evoke r,.2 On this basis it would 
appear reasonable to expect that any 
procedure successful in reducing the 
response strength of r, during latent 
extinction should result in the re- 
duction or elimination of the second- 
ary reward value of goal-box cues. 
Furthermore, the greater the reduc- 
tion in the response strength of 1,, the 
greater should be the reduction of 
secondary reward value. The ques- 
tion arises as to the type of experi- 
mental procedure that might be 
expected to effect a reduction in the 
response strength of r, during latent 
extinction. 

Hull (3), Spence (11), and others 
(5) have conceived the fractional 
anticipatory goal response to be a 
theoretical construct whose relation 
to a specified set of observable events 
follows the same behavioral laws as 
such molar responses as bar-pressing 
and salivation. Therefore, all those 
operations that are assumed to effect 
a reduction in the strength of a molar 
response should also effect a reduction 
in the strength of r,. If this be the 
case, then it appears reasonable to 
expect that the greater the number of 
nonreinforced evocations of r, during 
latent extinction the greater will be 
the reduction in response strength. 
In addition, since it has been shown 
that increasing the effective drive 
level will increase the rate of evocation 
of a molar response, it would also 
appear reasonable to expect that the 
higher the drive level during latent 


* This suggestion is based on the assumption 
that the characteristic interoceptive stimulation 
(sg) produced by r, comes to acquire secondary 
reward properties as a consequence of its close 
association with primary reward. Any neutral 
stimulus to which rz subsequently becomes con- 
ditioned will, in turn, acquire reward value 
through the arousal of s,. 


extinction the greater will be the 
number of nonreinforced evocations 
of r, that will occur during this 
period.* With these assumptions in 
mind concerning the relation between 
drive level, r, evocation, and second- 
ary reward value, the hypothesis of 
the present experiment may now be 
stated as follows. The higher the 
drive level during latent extinction 
the greater will be the reduction in 
the secondary reward value of cues 
present in the goal box as indexed by 
their capacity to mediate the learning 
of a new response in the absence of 
primary reinforcement. If it can be 
shown that these cues had previously 
acquired secondary reward value, 
then the condition necessary for 
testing the present experimental hy- 
pothesis would be satisfied. 


Metuop 


Subjects and apparatus.—The Ss were @O 
experimentally naive, male albino rats of the 
Wistar strain. The ages of the Ss ranged from 
100 to 150 days at the beginning of the experi- 
ment. 

The apparatus was a single-unit T maze and 
a straight alley. ‘The starting box, arms, and 
stem of the maze were painted gray. The stem 
of the maze was 21 in. long and 5 in. wide. 
There was a distance of 46 in. from the outer 
end of the left goal box to the outer end of the 
right goal box. The roof of the maze was 
covered with .25-in. mesh hardware cloth. 
Gray cloth curtains were placed in front of the 
doors to the goal boxes. Guillotine doors of 
light sheet metal were operated by £. The 
straight alley was 24 in. long and 5 in. wide and 
was also covered with .25-in. mesh hardware 
cloth and painted gray. Four goal boxes were 
used, two of which were painted black and two 
of which were painted with black and white 
stripes. The goal boxes were identical in all 
other respects. ‘The same goal boxes were used 
on both the maze and the straight alley. In 


*When applied to molar responses, these 
assumptions readily follow from Hull's postulate 
set (3). Also, Skinner (10) has presented 
empirical evidence supporting these assumptions 
with regard to bar-pressing of rats and Jenkins 
and Daugherty (4) with regard to the pecking 
response of pigeons. 
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order to be certain that all traces of food were 
eliminated on nonreinforced trials, the same 
black or striped goal box was always used during 
these trials, while another black or striped goal 
box was always used during the reinforced trials. 

Several layers of cheesecloth completely sur- 
rounded the maze and the straight alley. The 
cheesecloth was attached to an unpainted 
wooden frame, 4 ft. wide and 3 ft. high. The 
frame was nailed to the table on which the 
apparatus was placed. The use of the cheese- 
cloth was intended to eliminate as many extra- 
maze cues as possible and to provide a nearly 
uniform perceptual field. The only source of 
light was a 25-w. bulb placed 3.5 ft. above the 
apparatus. 

Habituation.—Prior to training, all Ss were 
given five days of habituation to the feeding 
regimen of the experiment. During these five 
days, Ss were fed an unlimited amount of wet 
mash once daily for 1 hr. 

Straight-alley training.—Beginning on Day 6, 
Ss were given 10 trials a day for four days in the 
straight alley. All Ss were under a 22-hr. 
hunger drive at the beginning of each day’s run. 
On seven of the 10 trials of each day, S found 
Purina Checkers scattered over the floor of the 
goal box. On the remaining three trials of each 
day, S ran to an empty goal box. The sequence 
of reinforced and nonreinforced runs was deter- 
mined randomly. ‘The intertrial interval was 
always at least 10 min. with a modal interval of 
12 min. All Ss remained in the goal box for 45 
sec. following either a rewarded or nonrewarded 
run. One-half of the Ss ran to a black goal box 
on rewarded trials and to a striped goal box on 
nonrewarded trials; the remaining one-half ran 
to the striped and black goal boxes on rewarded 
and nonrewarded trials, respectively. 

Latent extinction period.—Twenty-two hours 
after the last straight-alley trial, all Ss were 
assigned at random to two groups. The Ss of 
the experimental group were placed directly in 
the previously positive goal box (either black or 
striped) for four 1-min. periods without the 
presence of food (latent extinction). Each 
period was separated by approximately 10 miri. 
The Ss of the control group were not placed in 
the goal box during the latent extinction period 
but were treated like the experimental Ss in all 
other respects. 

The experimental and control Ss were further 
divided into three additional subgroups. Each 
subgroup consisted of 10 Ss. The Ss of one 
subgroup (O-hr. group) were maintained on an 
unlimited amount of food beginning with the 
termination of straight-alley training until the 
The 
Ss of the second subgroup (22-hr. group) were 
maintained on the 22-hr. feeding schedule used 


conclusion of the latent extinction period. 


during straight-alley training. The Ss of the 
remaining subgroup (44-hr. group) were not 
given their usual feeding following the last 
straight-alley trial so that at the beginning of the 
latent extinction period they had not been fed 
for approximately 44 hr. Thus, the experi- 
mental Ss were given latent extinction under 
either 0, 22, or 44 hr. of hunger. The control 
Ss were also made differentially hungry during 
the latent extinction period but were not, of 
course, given latent extinction. Immediately 
following the latent extinction period, all Ss 
were given an unlimited amount of wet mash for 
l hr. Thereafter, food was withdrawn so that 
at the beginning of the test trials on the following 
day each S was again under a 22-hr. hunger drive. 

Test trials.—The test trials were begun about 
22 hr. after the last feeding period. They con 
sisted of 15 free choices on the T maze. Food 
was not present in the maze during these trials. 
With the exception of the first test trial, one arm 
of the maze contained the previously positive 
goal box (either black or striped), while the other 
arm of the maze contained the previously 
negative goal box. On the first test trial, one 
half of the Ss found the previously positive goal 
box whichever way they turned, while the 
remaining Ss found the previously negative goal 
box on whichever side they chose. This pro 
cedure was designed to assure a 50% base line 
on the initial test trial. 

During testing, the intertrial interval was 
approximately 10 min. Correction or retracing 
after a response was not permitted, All Ss 
remained in the previously positive or previously 
negative goal box for 10 sec. following a choice. 

For convenience, a response to the side of the 
maze containing the previously positive goal 
box will be referred to as a “correct response” ; 
a response to the side of the maze containing the 
previously negative goal 
response.” 


box, an “incorrect 


Resutts 


In order to test the present hy- 


pothesis, it was first necessary to 
demonstrate that the procedure em- 
ployed during straight-alley training 
was adequate to establish the color 
of the positive goal box as a secondary 
reinforcer. A test of the adequacy 
of the training procedure in this 
respect was accomplished by com- 
paring the mean number of correct 
responses made by the control Ss 
during the test trials against chance 


expectancy (7.50). It can be seen 
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TABLE 1 


Means anv SDs or tHe Number or Correct 
Responses ror Eacn Group 


Experimental Control 
Ses Se 
Conditions 


Mean | SD |Mean| SD 


9.30 | 1.25 |9.30| .95 
7.10 | 99 | 9.9) | 1.37 
6.30 | 1.70 | 9.80} .79 
7.0 | 2.26 | 9.87) 1.13 
7.53 | 1.36 |947| .99 


0-hr. hunger 
22-hr. hunger 
44-hr. hunger 
Black goal box 
Striped goal box 








from Table | that the mean number 
of correct responses for control Ss 
that were under either a O0-, 22-, or 
44-hr. hunger drive during the latent 
extinction period was 9.30, 9.90, and 
9.80, respectively. Each of these 
values exceeds chance expectancy at 
the .O1 level. There can be little 
doubt that the training procedure 
employed in the present experiment 
was adequate to establish the color 
of the positive goal box as a secondary 
reinforcer. 

In Tables 2 and 3 are summaries of 
the analysis of variance of the number 
of correct responses for the experi- 
mental and control Ss combined, and 
for the experimental Ss separately. 
The analysis of variance for the 
control Ss separately is not shown 
since neither the main effects nor the 


TABLE 2 


Summary or ANALYSIS OF VARIANCE OF THE 
Number or Correct Responses 
ror Aut Ss 


Source df 


Experimental, 
Yontrol (EF) 
Drive (D) 


SrwN KN ew 


— 


Within cells 








SP < A. 


TABLE 3 


ANaAtysis or VARIANCE OF NumBer or Correct 
Resronses or Experimentat Ss Onry 


df MS | F 


Source 
Drive | 2 | 24.13 | 12.70° 
Color | l 04 | 
Drive X Color 

Within cells 


| 1.73 | 
24 | «19 | 


| 


*P <M. 


interaction were significant. The 
trial-by-trial percentages of correct 
responses for all experimental Ss and 
for all control Ss are shown in Table 
4. 

Before testing the present experi- 
mental hypothesis it is also necessary 
to demonstrate that the latent ex- 
tinction procedure was effective in 
reducing the secondary reward value 
of the color of the positive goal box. 
The mean number of correct responses 
of the experimental Ss was 7.57 as 
compared with 9.67 for the control 
Ss (P < 01, Table 2). The mean 
number of correct choices of the 
experimental Ss did not differ sig- 
nificantly from chance expectancy 
(t = .21; P > .80), while the mean 
number of correct choices of the 
control Ss was significantly in excess 
of chance expectancy (t = 11.42; 
P< Ol). It appears reasonable to 
conclude that latent extinction was 
effective in reducing the secondary 
reward value of the color of the 
positive goal box as evidenced by the 
failure of the latter subsequently to 
mediate the learning of a new 
response. Whether S was rewarded 
in the black goal box or in the striped 
goal box during straight-alley training 
did not differentially affect the num- 
ber of correct responses made during 
the test trials. 

As shown in Table 1, the mean 
number of correct choices made 
during the test trials for experimen- 
tal Ss given latent extinction under 
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TABLE 4 


Percentace or Correct Responses 


Groups 


Experimental 
Control 


0-, 22- or 44-hr. hunger drive was 
9.30, 7.10, and 6.30, respectively (P 
< 01, Table 3). The values of ¢ 
obtained by comparing the 0-, 22-, 
and 44-hr. experimental subgroups 
indicated that the O-hr. subgroup 
made a significantly greater mean 
number of correct choices than either 
the 22-hr. subgroup (t = 4.35; P 
< .O1) or the 44-hr. subgroup (t= 
4.49; P < Ol). While the difference 
between the 22-hr. and the 44-hr. 
experimental subgroups was in the 
expected direction, the value of ¢ 
obtained was not. significant § (t 
= 1.28; .20 < P < .30).* 


Before the experimental hypothesis 
is confirmed, it is necessary to ex- 
amine the number of correct responses 
made by the control Ss during the 


test trials. An analysis of variance 
(not shown) of the number of correct 
responses for the control Ss separately 
indicated that the variance between 
drive was smaller than the 
within-cells variance. Since, in addi- 
tion, the interaction between expeéri- 
mental the one 
hand, and different drive levels on the 
other, was significant (F = 10.99; 
P < .O1), it is concluded that differ- 
ences in drive level during the latent 
extinction period did not significantly 
affect the number of correct choices 


levels 


and control Ss on 


*The mean of 6.30, obtained by the 44-hr. 
subgroup is significantly than 
expectancy (P < .05). However, this 
appears to have been unduly influenced by one 
When this score is dis 
regarded, the mean becomes 6.70, which does 


lower chance 


value 
low extreme 


score 


not differ significantly from chance. 


of the control Ss. The results support 
the hypothesis that the higher the 
drive level during latent extinction, 
the greater the reduction in the 
secondary reward value of goal-box 
cues. 


Discusston 


The present experimental hypothesis 
was generated from a consideration of the 
fractional anticipatory response mecha 
nism. Hull has suggested that a stimulus 
that has acquired reward 
value will retain that value as long as it 
retains the power to evoke some frac 
tional component of the goal response 
(r,). 


secondary 


On this basis, it was assumed that 
a procedure designed to effect a greater 
reduction in the response strength of r, 
during latent extinction should result in 
a greater reduction in secondary reward. 
It was reasoned that (a) the greater the 
number of nonreinforced evocations of 
1, during latent extinction the greater 
will be the reduction in its response 
strength, and (4) the higher the drive 
level during latent extinction the greater 
will be the nonreinforced 
evocations of r, that will occur during 
this period. 


number of 


It was found, in conformity 
to expectation, that the higher the drive 
level during latent extinction the greater 
the reduction in secondary reward value 
as indexed by the capacity of cues that 
had previously acquired this value to 
mediate the learning of a new response. 

The present theoretical analysis sug 
gests an interpretation of latent extinc 
tion that is consistent S-R 
reinforcement approach. This analysis 
does not require that a response be 
performed in order for the extinction of 
that response to occur. But while the 
latent extinction phenomenon can be 


with an 
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deduced from Hull’s general behavior 
theory, it is clear that his response- 
induced inhibition theory is not adequate 
to mediate this deduction since it makes 
performance a necessary condition for 
extinction. This suggests that the re- 
sponse-induced inhibition theory is either 
inadequate to provide an explanation of 
extinction or that latent extinction 
represents a more complex phenomenon 
than that to which Hull applied his 
inhibition theory. But if it is main- 
tained that the inhibition theory is 
applicable to one class of response- 
elimination phenomena but is not appli- 
cable to response-elimination phenomena 
of another class involving more complex 
behavior mechanisms, then it is necessary 
to specify the parameters that distin- 
guish these classes. Until such specifi- 
cation is achieved, the inhibition theory 
must remain suspect as an explanation 
of extinction—whether that extinction 
is assumed to be of the “simple” or 
“complex” variety. 


SUMMARY 


Sixty male albino rats were given 10 trials a 
day for four days in a straight alley while under 


a 22-hr. hunger drive. The Ss were rewarded 
on seven of the 10 trials of each day but were 
not rewarded on the remaining three trials. 
One-half of the Ss ran to a black goal box on 
rewarded trials and to a striped goal box on 
nonrewarded trials; the remaining one-half ran 
to the striped and black goal boxes on rewarded 
and nonrewarded trials, respectively. 

About 22 hr. after the last straight-alley 
trial, the experimental Ss were placed in the 
previously positive goal box without the presence 
of food for four 1-min. periods (latent extinction). 
The control Ss were not given latent extinction. 
The experimental and control Ss were divided 
into three additional subgroups. The sub- 
groups were under either a O-, 22-, or 44-hr. 
hunger drive at the beginning of the latent 
extinction period. On the day following the 
latent extinction period the test trials were 
begun. These trials consisted of 15 free choices 
on a T maze, One arm of the maze contained 
the previously positive goal box, while the other 
arm of the maze contained the previously 
negative goal box. All Ss were again under a 
22-hr. hunger drive. Food was not present in 
the maze during the test trials. 


The results supported the hypothesis that the 
higher the drive level during latent extinction 
the greater will be the reduction in secondary 
reward value as indexed by the capacity of cues 
that had previously acquired this value to 
mediate the learning of a new response. The 
relation between the present experimental 
hypothesis and the fractional anticipatory 
response mechanism was discussed. 
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AVOIDANCE CONDITIONING 
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Department of Clinical Science, School of Medicine, University of Pittsburgh 


Several theoretical positions predict 
that the extinction of a conditioned 
response will proceed more rapidly 
with massed trials than with spaced 
trials. Such a prediction stems di- 
rectly from Paylov’s (12) internal 
inhibition theory, Mowrer’s (8) fa- 
tigue theory, Hull’s (4) reactive 
inhibition theory, and Miller and 
Dollard’s (7) fatigue theory of extinc- 
tion. A review of the literature 


indicates less agreement as to the 
empirical result in any given situation. 
Thus, most earlier investigators (2, 
12, 13) have concluded that extinction 
is faster when trials are massed than 
when they are spaced. 


V. F. Shef- 
field (15) found directly opposite 
results with rats trained to run down 
an alley to obtain food. Animals 
extinguished under the massed con- 
dition (one trial every 15 sec.) were 
found to extinguish less rapidly than 
animals extinguished under a spaced 
condition (one trial every 15 min.). 
After a critical review of the earlier 
studies (2, 12, 13) she suggested 
rather convincingly that the evidence 
for their conclusions was rather 
tenuous. 

Numerous studies have demon- 
strated the methodological signifi- 
cance of any change in procedure 
occurring concomitantly with the 


' This investigation was supported by a grant 
(M-487C2) from the National Institute of 
Mental Health, Institutes of Health, Public 
Health Service. 

2 The authors are indebted to Dr. L. Arthur 
Mirsky for suggestions and criticisms and to 
Mr. Edward Brown for aid in collecting the data. 
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instigation of extinction. In this 
respect avoidance training affords 
unique advantages over other types 
of conditioning. In the avoidance 
Situation no objective change occurs 
between conditioning and extinction. 
This is in contrast to other types of 
conditioning in which the presence or 
absence of the UCS is immediately 
apparent. However, in order to util- 
ize this advantage of avoidance 
training it is necessary to continue 
conditioning to a high degree. Avoid- 
ance conditioning differs from classical 
conditioning in that the UCS is not 
presented if the animal performs the 
conditioned response. Classical and 
instrumental conditioning are thus 
recognized as different phenomena, 
i.e., experimental results are restricted 
to the type of conditioning employed. 
However, instrumental avoidance 
learning is identical to classical con- 
ditioning during the early phase of 
learning. It is only after the animal 
learns to avoid the painful stimulus 
that the procedures diverge. In other 
words, avoidance conditioning per se 
begins with the first failure to get 
shocked, not with the first shock. 
The present study represents an 
attempt to develop a methodological 
procedure which be used to 
establish an avoidance response to a 
high degree without requiring a dis- 
proportionate amount of time. 

More specifically, the present study 
was undertaken to determine: (a) the 
relative effect of massed and spaced 
trials on resistance to extinction, and, 


can 
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(b) the methodological importance of 
greatly reducing the total amount of 
time required to condition and ex- 
tinguish an avoidance response. 


MetTuop 


from the 
These Ss 
of the 


Subjects.—Sixty rats obtained 
Sprague Dawley farms served as Ss. 
were 50-60 days old at the start 
experiment. 

Apparatus.—The apparatus consisted of a 
modified Warner (18) two-compartment con- 
ditioning box. The top and both ends were 
painted white with one side of the conditioning 
compartment formed by a sheet of milk glass. 
The remaining side consisted of a specially 
constructed one-way vision screen. Constant 
illumination was provided by a 25-w. bulb 
located at each end of the apparatus behind the 
translucent milk glass. A 2-in. barrier of }-in. 
stainless-steel rods separated the two compart- 
ments. ‘The top bar of the barrier was a }-in. 
stainless-steel rod mounted on bearings to reduce 
perching. An agastat time-delay relay was 
employed for automatic presentation of the 
shock stimulus (110 vy. with 85,000 ohms in 
series with S). Additional details of the ap- 
paratus have been reported previously (9). 

Procedure.—The conditioning trials for each 
daily test session were preceded by a 2-min. 
adaptation period followed by the presentation 
of a buzzer CS. The shock UCS was auto- 
matically delivered after 5 sec. if S failed to 
make the barrier-crossing response to the buzzer. 
The buzzer, or buzzer and shock, were termi- 
nated manually when S responded. 

The 6 Ss were randomly assigned to one of 
three groups with the following experimental 
conditions: in Group I, Ss were conditioned with 
an average intertrial interval of 20 sec. for each 
test period; in Group II, the intertrial interval 
for the conditioning and extinction trials aver- 
aged 40 sec. for each test period; in Group III, 
Ss were given three test periods with an average 
intertrial interval of 60 sec., three test periods 
with an average intertrial interval of 50 sec., 
three test periods with a 40-sec. average, three 
with a 30-sec. average, and three final condi- 
tioning test periods with an average interval of 
20 sec. Extinction for this group was completed 
with an average intertrial interval of 20 sec. for 
each test period. When the intertrial intervals 
during conditioning are totaled for the three 
groups, Group I had an average of 20 sec. 
between trials and Groups II and III had an 
average of 40 sec. between trials. 

Ten conditioning trials were given daily for 
the 15-day conditioning period and the 15-day 


extinction phase. The study was completed in 
three replications. One animal in Group III 
died during the conditioning phase. 


RESULTS 


The percentage frequency of CRs 
during conditioning is presented 
graphically in Fig. 1. The number 
of CRs by Group I Ss was significantly 
lower than Group II (P < .01) and 
Group III (P < .O1) as tested with 
Fisher’s exact test of significance (1). 
Groups II and III were not signifi- 
cantly different during the condi- 
tioning phase. 

A criterion found to be highly 
satisfactory in previous studies is to 
exclude Ss failing to respond at or 
above the 70% level for three con- 
secutive days during conditioning. 
Sixteen Ss in Group I failed to meet 
this criterion and no extinction data 
were obtained for this group. Four 
Ss in Group III failed to meet this 
criterion and six Ss in Group II were 
also excluded from the extinction 
measure. The Group I procedure 
produced significantly more non- 
learners than Group II (P < .01) 
and Group III (P < .O1) as tested 
with the chi-square test. 

With the nonlearners excluded, no 
significant differences were found be- 
tween Groups II and III for the 
conditioning phase. Median CRs 


PER CENT FREQUENCY OF Cis 











Fic. 1. The percentage frequency of CRs 
during the conditioning phase for all Ss. 
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Fie. 2. 


The percentage frequency of CRs 
during extinction. 


were 104.5 for Group III and 99.0 
for Group II. Median CRs during 
extinction were 72.5 for Group II 
compared with 132.0 for Group III. 
The extinction data in terms of per- 
centage frequency of response are 


portrayed in Fig. 2. The Ss con- 


ditioned with progressively shorter 
intervals between trials responded at 
a higher rate throughout extinction. 
This difference was significant at .003 


with Fisher’s exact test (1). 


Discussion 


The present study was designed to 
provide basic data on the theoretical 
problem of the effect of massed vs. 
spaced trials on resistance to extinction. 
An additional aim attempt to 
solve the practical problem of greatly 
reducing the amount of time required to 
condition and extinguish an avoidance 
response. 

Although the spacing variable was 
confounded somewhat by conditioning 
Ss under different time schedules, it 
appears clear that massing of trials did 
not lead to more rapid extinction as 
suggested by several theoretical positions 
(4,7,8,12). The more surprising aspect 
is that the massing technique resulted in 
significantly increased resistance to ex- 
tinction. This finding is similar to that 
reported by V. F. Sheffield (15) and 
attributed to the increased motivational 
effect generated by the frustrating omis- 


was an 


sion of food during extinction. As 
Sheffield indicated, however, the frus 
tration-induced motivation concept can 
not be applied to situations in which 
noxious stimulation is omitted 
extinction. 


during 


The data appear to conform with an 
ingenious hypothesis originally proposed 
by F. D. Sheffield and Temmer (16) and 
elaborated by Schoenfeld (14). Shef 
field has suggested that an avoidance 
response persists for an unusually long 
time in comparison with other types of 
conditioning as a result of the training 
procedure. 
avoidance training § 
making any 
arbitrarily 


In the conditioning phase of 
is punished for 
other than the 
designated CR. The pat 
terns of response-produced stimuli ac 
companying responses other than the 
CR may, as Schoenfeld (14) suggests, 
acquire negative reinforce 
ment properties. During extinction the 
only response which does not give rise to 
the negative secondary reinforcement 
cues is the CR, and this response is 
highly resistant to extinction. This is 
really not surprising in that, as F. D. 
Sheffield notes, “Failure to respond in 
time is the only condition regularly 
reinforced, and all of the cue patterns 
signifying such failure are conditioned 
to the response in the course of avoidance 
training” (16, p. 288). 

In a previous study (10) we have 
accepted the Guthrie (3) and Wendt (19) 
hypothesis that extinction results from 
the during extinction § of 
responses which are incompatible with 
the conditioned response and lead to its 
eventual extinction. Applying the 
Guthrie-Wendt hypothesis in conjunc 
tion with the F. D. Sheffield hypothesis 
to the data of the experiment described 
herein provides an 
monious 


response 


secondary 


acquisition 


apparently 
interpretation. In Ss with a 
longer time interval between extinction 
trials a greater number of incompatible 
responses occurred, 


parsi 


These incompatible 
washing or 
grooming, sitting quietly, moving about 
the apparatus, etc. With a 
occurrence of incompatible responses, 
these Ss were given more opportunity 


responses included face 


greater 
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for the negative secondary character- 
istics of the cue patterns to extinguish. 
Accordingly, Ss extinguished under the 
massed condition would receive less 
opportunity for extinction of the negative 
cue patterns accompanying incompatible 
responses and would be more resistant to 
extinction. 

In addition, the punishment of in- 
compatible responses was considerably 
different between groups during con- 
ditioning. As the intertrial interval for 
the massed group became shorter, fewer 
incompatible responses occurred, with 
the result that a greater proportion were 
contiguous with the fear accompanying 
buzzer presentation of the pain accom- 
panying the presentation of shock. 

From a methodological viewpoint the 
utilization of progressively diminishing 
intervals during the conditioning phase 
has the distinct advantage of continuing 
conditioning well beyond the classical- 
instrumental dichotomy without re- 
quiring a disproportionate amount of 
time. Such a procedure appears to 
provide an alternative to the type of 
study (5, 6, 11) which employs a low 
criterion of avoidance training and then 
attempts to draw generalizations which 
apply to avoidance conditioning gener- 
ally. Although these studies (5, 6, 11) 
are empirically sound, it would appear 
desirable to continue avoidance training 
considerably past the point at which 
conditioning changes from a classical to 
an instrumental procedure. 

Animals conditioned under a constant 
20-sec. interval (Group I) were included 
in the study to anticipate the possibility 
that the advantages of the diminishing 
interval procedure could be obtained by 
starting Ss directly on a 20-sec. schedule. 
The failure of Ss learning under the 
Group I procedure to acquire the re- 
sponse was apparently attributable to 
the close temporal relationship of the 
CS with the UCS of the preceding trial. 
After Ss had acquired the response, as in 
Group III, the UCS was seldom present 
and this factor played a comparatively 
minor role. 

The technique of progressively short- 
ening the intertrial interval as condi- 





tioning proceeds appears to have two 
advantages: (a) conditioning is en- 
hanced; and (4) observations may be 
made on a much larger number of Ss. 


SUMMARY 


A conditioned avoidance response was de- 
veloped in three groups of rats with a modified 
two-compartment apparatus. Progressively re- 
ducing the intertrial interval by a total of 67% 
distributed over the course of conditioning did 
not significantly alter the acquisition of the 
response. However, when Ss were maintained 
throughout the course of conditioning on the 
reduced intertrial interval, learning was sig- 
nificantly retarded. 

Animals conditioned under the progressively 
diminished interval procedure (intertrial inter- 
vals starting at 60 sec. and diminishing to 20 
sec. during conditioning with extinction at a 
constant 20-sec. interval) were significantly more 
resistant to extinction than Ss maintained on an 
average 40 sec. intertrial interval throughout 
conditioning and extinction. That is to say, 
progressively massing trials during conditioning 
with massed trials throughout extinction re- 
sulted in significantly increased resistance to 
extinction. It was noted that this finding is 
contrary to predictions from fatigue theories of 
extinction and an alternative interpretation was 
presented. 
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DRIVE LEVEL AS A FACTOR IN DISTRIBUTION OF 
RESPONSES IN FIXED-INTERVAL 
REINFORCEMENT 
BERNARD WEISS AND EDWIN W. MOORE 
USAF School of Aviation Medicine 


One of the ways in which reinforce- 
ment may be correlated with behavior 
is to make the former available as a 
periodic function of time. Practice 
on such a schedule in the Skinner box 
eventually leads to characteristic devi- 
ations from a constant rate of re- 
sponding. The most prominent of 
these appear as depressions in rate 
directly following the delivery of a 
reinforcement and as subsequent in- 
creases later in the interval. 

In his earlier formulation, derived 
from rat performance, Skinner (6) 
proposed that these deviations re- 
flected a temporal discrimination 
arising from two circumstances: (a) 
responses which directly follow rein- 
forcement are never reinforced; (b) 
responses which follow close upon 
other responses are seldom reinforced. 
The increase in rate which followed 
the postreinforcement depression was 
described as a compensatory effect. 
A more recent statement (7), utilizing 
data from the pigeon, attributes the 
depression in rate to a discrimination 
which is based on the cues produced 
during ingestion and asserts that once 
responding is resumed it is emitted at 
a constant rate. The last formu- 
lation vitiates the temporal discrimi- 
nation properties of the phenomenon. 

It has been found, however, that 
viewing these deviations as a temporal 
discrimination provides an interesting 


and useful index of the internal 
chemical state of an animal. A 
recent study (8) showed that the 


acuity with which the discrimination 
was performed, i.e., the degree to 
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which responses tended to be con- 
centrated toward the end of the inter- 
reinforcement interval, could be used 
as an index of drug-induced impair- 
ment. While there did not appear 
to be a relationship between the 
amount of impairment and appetite 
(as measured by food consumption), 
an experiment was undertaken to 
obtain more precise information on 
this question.' 


Metuop 


Sixteen male albino rats with experience in 
the Skinner box were used as Ss. After 250 
min. of training on a fixed-interval reinforcement 
schedule (i: = 3 min.), Ss were randomly as- 
signed to two groups, with eight Ss per group. 
One group was put on a 22-hr. food deprivation 
program, the other on a 5-hr. deprivation pro- 
gram. Both groups were then given 150 min., 
spaced over five days, of further practice on the 
3-min. fixed-interval schedule. ‘The Ss were fed 
for 1 hr. daily. 

The response distributions were obtained by 
projecting the records and counting the number 
of responses in each of the ten 18-sec. inter- 
reinforcement subintervals of the last five 3-min. 
intervals available from the 150-min. period. 


RESULTS 


The data are shown graphically in 
Fig. 1. It can be seen that increased 
drive level produces an increase in 
response rate, the result typically 
obtained. It also appears that the 
number of responses per subinterval 


'Sidman (5) has also found temporal dis- 
crimination to be a useful index of drug effects. 
His data, moreover, as do those reported in the 
present paper, suggest that there is no necessary 
relation between response rate and acuity of 
discrimination, 
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Fic. 1. Distribution of responses between 
reinforcements and least squares lines of best 
fit for both drive levels. 


tends to rise as the end of the 3-min. 
interval is approached. Both of these 
impressions are supported by the 
results of the analysis of variance in 
Table 1. 

The F ratio for Interval K Drive 
interaction indicates that the distri- 
bution of responses between rein- 
forcements is not significantly affected 
by drive level. To determine the 
shape of this distribution, the con- 
tributions of the first- to the fifth- 
order components were obtained by 


TABLE 1 


Anatysis or VARIANCE oF THE RESPONSE 
Distrisution Data 


Source ‘ P 


Drive Levels | 636.00 
Ss Alike | 39.48 | 

Intervals | 40.29 
Linear 304.03 | 
Quadratic 
Cubic 
4th Order 
Sth Order 


16,01°* 


10.41°* 
78.56°* 





means of the tables of orthogonal 
polynomials in Fisher and Yates (2). 
The results, as shown in Table 1, 
demonstrate that the preponderant 
contribution is made by the linear 
component. However, a significant 
increment in precision is afforded by 
the third-order component, indicating 
that a cubic equation provides the 
best fit of the function. 


Discussion 


These results impose two conclusions 
about the character of performance in 
fixed-interval reinforcement under the 
conditions of the present experiment. 

The primary effect of drive level is on 
rate of responding. When increased, the 
principal result is to impel the organism 
to sample the environment more fre- 
quently; i.e., it uniformly lowers the 
threshold of response evocation through 
time. Moreover, if the distribution of 
responses is viewed as a temporal dis- 
crimination, then it can also be said that 
the acuity of this discrimination is un- 
affected by drive. Such a finding might 
be taken as a demonstration that the 
“quality” of performance is in some 
measure independent of drive level and 
appears related to the evidence that 
increased drive in maze learning pro 
duces a reduction in time but not in 
error scores (e.g., 3). It is most closely 
linked to the study by Dinsmoor (1). 
Here, a discrimination situation was 
arranged so that reinforcement was 
available in the presence of a light (8?) 
but was withheld during darkness (5°). 
Under these conditions, variation in 
drive level (body-weight loss) led to 
changes only in the over-all response 
rate, and not in the ratio of 8? to S* 
responses. 

Response rate accelerates from the 
beginning to the end of the interreinforce 
ment period. This contradicts Skinner's 
statement, based on pigeon data, that 
once the period of no responding which 
follows a reinforcement is ended, the 
organism thereafter responds at a con- 
stant rate. It is possible that the 
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contradiction arises from an interspecies 
difference which allows the rat to utilize 
cues correlated with time that are not 
available to the pigeon. For, when the 
pigeon is furnished with an external cue 
which corresponds to the passage of 
intra-interval time, the records (7, Fig. 
4) then display the acceleration in rate 
found in the present study. This may 
indicate a deficiency in the pigeon’s 
“clock,” and suggests that distributions 
with even greater skewness may be 
produced by organisms phyletically su- 
perior to the rat. 

Another way of looking at the distri- 
butions is to plot the mean interresponse 
latencies for each period. This pro- 
cedure yields, in conformity with 
Mueller’s model and data (4), an ex- 
ponential function. The present results 
suggest that the exponential distribution 
of the interresponse latencies found by 
Mueller (who also used a 3-min. interval) 
may partly be due to the tendency to 
decrease these latencies as the end of the 
interval is approached, for Mueller 
assumes that responses are randomly 
distributed in time. 

The present findings, together with 
those obtained before (8), suggest that 
the distribution of responses with a 3- 
min, interval tends to be mainly linear 
over a wide range of practice, since the 
earlier study provided twelve hours 
rather than the approximately four hours 
used in this experiment. It might also 
be noted that in this previous study no 
significant increase in the precision of 
fit was contributed by the quadratic and 
higher terms. This is not to say that 
more extended practice and other inter- 
vals will result in distributions of the 
same character. But it would be sur- 
prising if the main finding, i.e., the 
increase in rate as the end of the interval 
is approached, were to be jeopardized. 

A final word is perhaps in order about 
the statistical approach to these data. 
Skinner (6, 7) maintains that statistical 
methods have little to offer to a science 
of behavior because they obliterate the 
principles which typify individual per- 
formance. Studies such as the present 








one, however, demonstrate that certain 
features of behavior are better appre- 
hended through the use of statistical 
methods and, in fact, may provide 
information not available from sheer 
inspection of the raw data. 


SUMMARY 


Sixteen albino rats were given 250 min. of 
training on a 3-min. fixed-interval reinforcement 
schedule in the Skinner box. Half were then 
subjected to a 22-hr. food deprivation program, 
the other half to a 5-hr. deprivation program. 
All Ss were then given 150 min. of additional 
practice. Each of the last five 3-min. intervals 
of this period were divided into ten 18-sec. sub- 
intervals and the number of responses in each 
subinterval counted. The results demonstrated 
that: (a) drive principally affects the rate of 
responding and does not alter the way in which 
responses are distributed in time between 
reinforcements; (b) response rate tends to in- 
crease from the beginning to the end of the 
interreinforcement period. 
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AN INVESTIGATION OF THE 
DEFINING DRIVE 


PARAMETERS 
(D) 


EUGENE EISMAN 


University of California, Riverside 


In a recent study (1) it was found 
that groups of rats trained under 22 
hr. and 4 hr. of food deprivation, in a 
black-white discrimination problem, 
did not differ from each other in trials, 
errors, or reinforcements to the cri- 
terion of learning. As a_ possible 
explanation certain aspects of the 
maintenance procedure were analyzed. 
It was shown that both groups were 
fed for 1 hr. during each 24-hr. period, 
although the 4-hr. group was run 4 hr. 
after feeding and the 22-hr. group, 22 
hr. after feeding. This suggested that 
for an adequate definition of drive (D) 
more than hours of deprivation at the 
time of measurement need be speci- 
fied. It was hypothesized that D is 
some function of at least two parame- 
ters, hours of deprivation at the time 
of measurement (h) and a second 
parameter corresponding to hours of 
deprivation during a unit of time 
(h,). It was further hypothesized 
that A is related to stomach distension 
and its effects would therefore reach 
a maximum when the stomach was 
completely empty. There is some 
evidence that the emptying time of a 
rat’s stomach is approximately 4 hr. 
(6). In the study mentioned, there- 
fore, the 4hr. and the 22-hr. groups 
were equated for both A, and the 
effects of Ah. Although the groups 
were run, respectively, 4 hr. and 22 
hr. after feeding, according to the 
present assumption, the component 
of D contributed by A was at or near 
its asymptote and, therefore, equal 
for both groups. Also, since both 
groups were deprived for 23 hr. out 
of each 24-hr. period, they were 
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equated for hy On this basis no 
difference should have been expected 
in any measure reflecting D. 

The present experiment was de- 
signed to determine whether the above 
ad hoc explanation had merit. It was 
hypothesized that groups of rats 
maintained under 47 hr. of deprivation 
out of 48 hr. (A; = 47), and run 4 hr., 
22 hr., and 45 hr. after feeding (A = 4, 
22, and 45 hr.), respectively, would 
not differ in learning a brightness 
discrimination. Two additional 
groups, one deprived for 24 hr. out 
of 48 hr. and run 22 hr. after feeding 
and a second, deprived for 5 hr. out 
of 48 and run 4 hr. later were also 
studied. It was, therefore, predicted 
that these groups would display 
slower learning than the previous 
groups and that the latter of the two 
would be poorer than the former. 

Using a binomial system of desig- 
nation, the first number indicating the 
parameter A, and the second the 
parameter h, the predictions may be 
summarized, conveniently, as follows: 
(47/45 47/22 47/4) > 24/22 
> 5/4. A set of contrary predic- 
tions, would be yielded, however, if 
the more traditional definition of 
drive, viz., hours of deprivation at the 
time of measurement, were consid- 
ered. These predictions, using the 
same system of designation, would be 
of the following order: 47/45 > (47 
22 = 24/22) > (47/4 = 5/4). 


Metuop 


The apparatus and general procedure used in 
this study were described in detail elsewhere (1). 
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Only the necessary, abbreviated details will be 
presented here. 

Subjects.—-The Ss were 45 naive, male, 
hooded rats between 120 and 150 days of age. 
They were divided, at random, into five groups 
of nine each. ‘Three Ss from one group and one 
from another died during the experiment leaving 
9, 9,9, 8, and 6 Ss per group, respectively. 

Apparatus.—The discrimination apparatus 
used in this experiment was adapted from Grice 
(4). Briefly, it consisted of a small, gray 
starting box opening into a gray, Y-shaped 
choice area from which S could enter either a 
black or white end box. The right-left place- 
ment of the black and white boxes was varied 
from trial to trial. 

Conditions.—The five groups, mentioned 
above, differed with respect to their maintenance 
schedule. ‘Three groups were fed for | hr. during 
each 48-hr. period (deprived for 47 hr. out of 
each 48-hr. period). They were run, 4 hr., 22 
hr., and 45 hr. after feeding, respectively. The 
fourth group was deprived for 24 hr. out of each 
48-hr. period and was run 22 hr. after feeding. 
The fifth group was deprived for 5 hr. out of 48 
hr. and run 4 hr. after feeding. The groups 
were designated as Groups 47/4, 47/22, 47/45, 
24/22, and 5/4, respectively, using the h,/h 
system. ‘Two weeks were allowed for adaptation 
to the maintenance schedule.! 

Adaptation.—After Ss were adapted to the 
maintenance schedule, adaptation to the ap 
paratus was begun. During this period the 
black-white boxes were replaced by essentially 
identical gray boxes. One-half hour of group 
exploration, five Ss at a time, was given for three 
experimental days. On the last day of group 
exploration the food cup in each box was filled 
with a quantity of food. During the remainder 
of the adaptation, Ss were placed in the ap 
paratus singly, and given, on respective days, 
2, 3, 6, 6, and 6 free trials to a gradually de- 


' Motivational conditions in this study were 
specified solely in terms of the deprivation 
schedule. A record of the weight loss by groups, 
perhaps, would have been of additional value. 
Such data were not obtained. It is not likely, 
however, that the various hs had any differential 
effect on eating behavior. Consequently, the 
various groups in which A; was 47 most probably 
sustained the same loss in weight. The A, = 24, 
as well as the 5/4, groups had food in their cages 
for many hours during each 48-hr. period and 
certainly ate more than the groups in which A, 
was 47. It is felt, therefore, that the com- 
parisons among the three A, = 47 groups and 
between them and the A; = 24 group did not 
suffer as a result of the absence of such weight- 
loss data. 


creasing amount of food. By the sixth adap- 
tation trial they were running to one pellet, as 
was to be the case during training. 

Training.—After Adaptation Trial 23, the 
black-white end boxes were substituted for the 
gray and only the white box was baited. This 
was done to correct for an initia] black preference 
noted in pilot work. The Ss were given six 
spaced trials per experimental day. On an 
incorrect run S was kept in the black box for 
approximately 20 sec. The criterion of learning 
was 14 out of 16 errorless trials. 


ResuLts 


The mean number of errors, rein- 
forcements, and trials to reach the 
criterion, for each group except Group 
5/4 are presented in Table 1. The 
reason for omitting this group will be 
given below. The over-all difference 
among groups on each measure was 
tested using a simple analysis of 
variance. Since over-all differences 
for reinforcements and trials were 
significant beyond the .O1 level, t tests 
were indicated. Group 24/22 differed 
from the others at beyond the .01 
level for reinforcements. It differed 
from Group 47/4, for trials, at beyond 
the .02 level and from Groups 47/22 
and 47/45 at beyond the .01 level. 
None of the differences between 
groups maintained under h, = 47 
were significant, regardless of the 
value of h. Over-all error differences 
were not significant, and therefore ts 
were not calculated. 

Of the 9 Ss in Group 5/4 (hy = 5 
and h = 4), 5 exceeded the latency 
criterion for exclusion from the experi- 
ment. The remaining Ss responded 
correctly on so few trials that training 
was terminated after Trial 32. If all 
Ss were to have run correctly from 
Trial 33 on, the mean trials to criterion 
could not have been less than 46 trials, 
well beyond the range of the other 
groups. This group then was con- 
sidered to have shown the poorest 
learning of all. 





PARAMETERS DEFINING DRIVE (D) 


TABLE 1 


Mean Errors, Retnrorcements, anp Trias To THe Criterion 


Errors | 


Group 


Mean 


11.12 
10.66 
10.16 
12.77 


In summary, the order of learning 
for the groups, with respect to trials 
and reinforcements to the criterion, 
was as follows: 47/4 = 47/22 = 47 
45 > 24/22 > 5/4. The Ss _ with 
high h, required fewer trials and rein- 
forcements to reach the same criterion 
than the lower h, groups. The differ- 
ences and lack of differences among 
groups were independent of the com- 
monly used parameter A. 

Although the groups did not differ 
with respect to mean errors to the 
criterion, there was a difference in the 
distribution of errors between the low 
h, and the high A, groups. In Table 
2 the mean number of errors per group 
for Trials 1-12 and 13-24 are shown. 
It may be seen that Group 24/22 
made fewer errors than the groups in 
which h, was 47 during Trials 1-12, 
but more errors during Trials 13-24. 
An analysis of variance of the differ- 
ences among the mean differences 
between Trials 1-12 and 13-24, gave 
F = 4.18 (df = 3 and 28; P < .05). 


TABLE 2 


12 anp 13-24 


ror Eacn Group. 


Errors on Trias 1 


Triale 1-12 Trials 13-24 
Group 


Mean 


47/4 7.50 
47/22 7.44 
47/45 7.50 
24/22 6.56 


Reinforcements 


Mean SD 
30.87 
29.55 
29.83 
37.77 


3.48 
4.27 
5.24 
6.27 


The intergroup differences between 
mean differences were tested with ¢. 
The differences between Group 24/22 
and each of the A, = 47 groups was 
significant at beyond the .02 level. 
None of the differences between any 
pair of groups in which A; was 47 was 
significant, with P ranging between 
> 70 and > .90. 

A further analysis of the mean 
errors during Trials 1-12 was possible. 
In a two-choice situation, such as the 
black-white discrimination problem of 
this study, the probability of an error 
is assumed to be .5. The expected 
mean number of errors in 12 trials, 
then, is 6. Using t, the mean errors 
per group was tested against this 
theoretical value. For Group 24/22, 
the mean errors during Trials 1—12 
differed from the expected value at 
beyond the .30 level, but for Groups 
47/4, 47/22, and 47/45 the signifi- 
cance levels were < .O1, < .05, and 
< .05, respectively. The latter 
groups, then, made more errors on the 
first 12 trials than could reasonably be 
expected by chance, whereas Group 
24/22 made no more errors than would 
be expected. 


Discussion 


The results, with respect to trials and 
reinforcements to the criterion, were in 
line with predictions based upon a two 
parameter definition of drive. This was 
not the case with respect to errors to the 
criterion, It indicated, 


was however, 
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that the groups did differ with respect to 
the distribution of errors. The high A, 
groups made significantly more errors on 
Trials 1-12 than would be expected by 
chance, whereas the difference in the low 
Ay group was not significant. The differ- 
ence in mean errors between Trials 1-12 
and Trials 13-24 was also greater for the 
high A, Ss than for the low A, Ss. These 
differences in the pattern of errors can 
be explained by assuming that an in- 
correct habit was dominant at the be- 
ginning of training. It was noted above 
that there was an initial black preference. 
Thus, following Hull’s assumption that 
performance is some multiplicative func- 
tion of D and H (5S), we would expect Ss 
with high drive and dominant incorrect 
habits to commit more errors early in 
training than Ss having equally dominant 
incorrect habits but low drive. Since in 
this study the Ss were randomly assigned 
to the different groups it is a plausible 
assumption that the groups were equated 
with respect to correct and incorrect 
habit tendencies and the observed differ- 
ence in the distribution of errors was due 
to the multiplication of equal Hs by 
unequal Ds. In this case the inequality 
in D could only have been determined 
by the different As. It is concluded 
therefore that the experimental hypothe- 
sis was confirmed and that an adequate 
definition of drive requires the specifi- 
cation of at least two parameters, 4 and 
he. 


Taking into account the parameter h,, several 
articles from the literature bear re-analysis. In 
Finan’s paper (3) learning was studied as a 
function of four levels of drive. Groups were 
trained under 1, 12, 24, and 48 hr. of food 
deprivation. Animals were fed for 1 hr. each 
day and eventually trained and tested at that 
hour. The I-hr. group received food 1 hr. 
earlier on the training day. The 12-hr. group 
was delayed in its feeding on the previous day 
and fed at a time such that 12 hr. intervened 
between the end of feeding and training. The 
24-hr. group was fed at the regular hour on the 
day previous to training and the 48-hr. group 
was not fed at all on the day previous to training, 
thereby having 24 hr. and 48 hr. of deprivation 
previous to training, respectively. Extinction 
was under a uniform 24 hr. of deprivation, The 
major results were that the l-hr. group was 


significantly less resistant to extinction than the 
other groups but none of the other intergroup 
differences was significant. It is clear from the 
results of the present study that all groups were 
under essentially 4, = 24 hr., and that only h 
varied at the time of training. Groups with an 
h of 12 hr., 24 hr., and 48 hr., respectively, should 
not have been different from each other and 
should have been superior to the h = 1 hr. group, 
as was the case. 

In a similar study, Strassburger (8) examined 
learning under 4, 1, 4, 11, 23, and 47 hr. of food 
deprivation. His maintenance schedule was of 
a similar design to the one described above. The 
Ss were kept under 22-hr. deprivation until the 
training day, at which point the schedule was 
interrupted to provide the desired levels of A. 
The 4-hr. group was fed sufficiently early to 
allow 4 hr. to intervene between the end of eating 
and conditioning. The 1-hr. group was simi- 
larly treated, etc. Extinction was under a 
uniform 23-hr. deprivation. Contrary to Finan, 
however, no differences were found even between 
the very low A and the high A groups. In terms 
of the present assumptions A should provide 
differential results as a function of variation 
below about 4 hr. There were, however, a 
number of other factors that may have been 
influential in obscuring any differences that may 
have developed as a function of conditioning 
under 4-hr. deprivation as compared with con- 
ditioning under 4 hr. and more. During adap- 
tation all Ss were under 22-hr. deprivation and 
received 40 pellets through the magazine of the 
Skinner box. If differences due to A were 
minimal, they could have been obscured due 
to the operation of secondary reinforcement, 
secondary drive, or some other irrelevant 
variable operative during extinction. 

In a more recent paper Ramond (7) studied 
the learning of an instrumental response as a 
function of two levels of hunger deprivation and 
also delay of reinforcement. Only the former 
variable is of immediate interest. He too de- 
prived both drive-level groups for 23 out of 24 
hr. but tested one group 22 hr. and the other 4 
hr. after feeding. With a single-tailed test P 
was .08 for the significance of the difference 
between groups. This was considered adequate 
for rejection of the null hypothesis. Under a 
two-tailed hypothesis or with a more rigorous 
criterion of significance (P S .05), however, the 
null hypothesis would have been accepted. It 
was presumably felt that the difference between 
high and low drive and its influence on learning 
was sufficiently well established that a less 
rigorous level of confidence could be accepted. 
In view of the present study this thinking 
becomes questionable. 
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Certain gaps, of course, remain in the 
present formulation. Whether the rela- 
tionship between A and Ay is multipli- 
cative or additive, etc., is still unknown, 
The various quantitative interrelation- 
ships between A and A; and other con- 
structs, such as D, sHpr, and sEr must 
be determined. And finally, it should be 
noted that the importance of A;, or some 
similar factor, has long been assumed, 
although generally neglected in favor of 
h. This is indicated in a recent paper 
(2) in which it is asserted that a 24-hr. 
period of food deprivation has different 
effects on an S which has been on an ad 
libitum diet as compared with an S§ that 
has recently undergone prolonged depri- 
vation. The importance of A; is also 


implied in those studies in which drive 
is specified in terms of weight loss rather 
than hours of deprivation. 


SUMMARY 


Forty-five rats (41 completed the experi- 
ment), divided into five groups, were run in a 
black-white discrimination problem. The five 
groups correspond to different degrees of food 
deprivation as follows: Group 5/4—deprived for 
5 hr. out of every 48 hr. and run 4 hr. after 
feeding; Group 24/22—deprived for 24 hr. out 
of every 48 hr. and run 22 hr. after feeding; 
Group 47/4—deprived for 47 hr. out of every 
48 hr. and run 4 hr. after feeding; Group 47/22 
deprived for 47 hr. out of every 48 hr. and run 
22 hr. after feeding; and Group 47/45—deprived 
for 47 hr. out of every 48 hr. and run 45 hr. after 
feeding. 

It was hypothesized that D is some function 
of two parameters, A (related to hours of depri- 
vation at the time of measurement) and hy, 
(related to total hours of deprivation during a 


unit of time). It was further assumed that A is 
related to stomach distension and that its con- 
tribution to D reaches an asymptote at about 
4 hr. In this case, then, the only relevant 
parameter was A, and it was predicted that 
learning facility would be in the order: (47/4 
= 47/22 = 47/45) > 24/22 > 5/4. The ex- 
perimental hypothesis was confirmed. 

The implications of the parameter h; and the 
necessity of specifying both A, and the usual 
parameter A were shown by reference to the 
literature. 


REFERENCES 


1. Ersman, E., Asimow, A., & Marrzman, I. 
Habit strength as a function of drive in a 
brightness discrimination problem. J 
exp. Psychol., 1956, $2, 58-64. 

. Brown, J.S. Pleasure-seeking behavior and 
the drive-reduction hypothesis. Psychol. 
Rev., 1955, 62, 169-179. 

. Finan, J. L. Quantitative studies in moti 
vation: I. Strength of conditioning in rats 
under varying degrees of hunger. /. 
comp. Psychol., 1940, 29, 119-134. 

. Grice, G. R. The relation of secondary 
reinforcement to delayed reward in visual 
discrimination learning. J. exp. Prychol., 
1948, 38, 1-16. 

. Huw, C. L. Essentials of behavior. 
Haven: Yale Univer. Press, 1951 

. Muyy, N. L. Handbook of prye hological 
reszarch onthe rat. New York: Houghton 
Mifflin, 1950. 

. Ramonp,C. K. Performance in instrumental 
learning as a joint function of delay of 
reinforcement and time of deprivation. 
J. exp. Psychol., 1954, 47, 248-250. 

. Srrasspurcer, R. C. Resistance to extinc- 
tion of a conditioned operant as related to 
drive level at reinforcement. Jj. exp. 
Prychol., 1950, 40, 473-487. 


New 


(Received August 4, 1955) 








Journal of Experimental Psychology 
Vol. 52, No. 2, 1956 


IRRELEVANT OR PARTIALLY CORRELATED STIMULI IN 
DISCRIMINATION LEARNING! 


M. A. JEEVES 


Cambridge University 


In this study the following four 
questions were investigated: What is 
the effect of (a) an irrelevant stimulus 
on discrimination learning, (b) the 
previous irrelevance of a stimulus on 
its subsequent discrimination, (c) a 
partially correlated stimulus on dis- 
crimination learning, and (d) previous 
partial correlation of a stimulus on its 
subsequent discrimination? 

Experiment I dealt with the first 
two questions, and Exp. II with the 
latter two. Both experiments were 
intended as a basis for further evalu- 
ation of the continuity and noncon- 


tinuity theories of discrimination 
learning. 
Experiment | 
Method 
Procedure.—The apparatus (see Fig. 1), 


which has been described elsewhere (2), was a 
four-unit linear maze with a starting box and a 
goal box. In each unit were two aluminum 
gates hinged at the top. Either gate could be 
locked by placing a nail behind it. Before each 
gate was a movable floor plaque. The bars on 
plaques were three-dimensional and painted 
gray. 

The Ss were 32 male albino rats, 110 days old 
at the outset of the study. ‘They were adapted 
to the apparatus by a series of steps ending with 
a limited number of forced choices in which 
gray gates and plaques without bars were used. 
Self-correction of errors (defined as touching a 
gate with the nose) was permitted. At the end 
of the run S was allowed a few seconds to eat a 
wet mash of Purina Laboratory Chow in the 
goal box. Each group was fed once daily for 
15 min. following the session. 

Design.—The 32 rats were divided at random 
into an experimental (Group E) and a control 


!The present study is part of a series com- 
prising the “Cognition Project” under the 
direction of J. S. Bruner of Harvard University. 
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(Group C) group. In Task 1, form discrimi- 
nation was requited (e.g., to approach longi- 
tudinal bars). Group FE learned the form 
discrimination with color irrelevant, i.e., the 
positive form was paired randomly with black 
and white gates. The reason for using color 
rather than form as the irrelevant stimulus was 
that in this way it was certain that the rats were 
exposed to an irrelevant stimulus (black vs. 
white) which is highly distinctive and learnable 
for rats. Group C learned the form discrimi- 
nation with color held constant (both gates 
gray). In each group, longitudinal was positive 
for half the rats, and lateral was positive for the 
other half. Both groups were given 128 trials 
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Fic. 1. Top view of unit of the four-unit 
linear maze. A and A’ are top-hinged aluminum 
gates. B and B’ are movable floor plaques. 


C is a partition, D is an air gap; and E a locking 
nail. Each unit is 28 in. long, 4 in. high, and 
8 in. wide at widest part. Lateral and longi 
tudinal bars are shown as hatched. 


(32 runs) on Task 1, which was sufficient for 
most Ss to attain mastery. 

In Task 2, both groups? learned a simul 
taneous-type black-white discrimination without 
the forms being present. ‘That is to say, gray 
floor plaques without bars were located before 
the gates. In each group, black was positive 
for half the Ss, and white for the other half. 
There were 80 trials. 

The stimulus orders were set up as follows: 
The positive stimulus appeared twice on the 
right and twice on the left in each run (four 
trials), and equally often on each side in each 
maze unit in each block of four runs. Where 
colors were paired with forms, in each run a 


? The two slowest learners in each group were 
dropped at the end of Task | in order to increase 
the sensitivity of Task 2 comparisons. In 
addition, one rat had to be dropped from each 
group early in Task 2 because of “gate shyness,” 
thus making 13 Ss in each group. 
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given color was paired twice with each form. 
Within these restrictions, orders were random. 


Results 


Groups E and C, according to Fig. 
2, learned form discrimination (Task 
1) at about the same rate. Group E 
had a mean of 26.75 errors (SD 
= 9.97), while Group C had a mean 
of 31.69 (SD = 14.16). The differ- 
ence between these means is not 
significant (t = 1.10). 

Turning to Task 2, Groups E and 
C (see Fig. 3) learned color discrimi- 
nation at about equal rates. Group 
E had a mean of 15.08 errors (SD 
= 3.99), while Group C had a mean 
of 12.92 (SD = 4.16). The difference 
in means is not significant (t = 1.30). 


Although the samples were small, 
there was little intragroup variability, 
so that a difference in means of only 
3.43 would have been significant at 
the 5% level. 


ExpeRImMeENT II 
Method 


The apparatus, methods of pretraining, and 
details of procedure were the same as in Exp. I. 
The Ss were 31 albino rats which were 110 days 
old at the outset of the study. 

In Task | form was the critical variable. One 
color was paired with the positive form on 75% 
of the trials and with the negative form on the 
remaining 25% of the trials. For example, 
black would be the “75% color” if it were paired 
with the positive form on 75% of the trials, 
while white would be the “25% color,” since it 
would be paired with the positive color on only 
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Fic. 3. Errors on Task 2 color discrimination. 


25% of the trials. In Task 1 all 
given 128 trials (the same as in Exp. I.) 


31 rats were 
This 
amount of practice was sufficient for an average 
of 95% correct responses on the last 16 trials of 
training 

In Task 2 color (black vs 
critical variable, and form was held constant 
(longitudinal and were absent). 
The 31 rats were divided at random into two 
groups. “Group 75” was reinforced for re- 
sponding to the 75% color, i.e 


white) was the 


lateral bars 


. the color whic h 
had been more often positive in Task 1, while 
“Group 25” was reinforced for responding to the 
25% color (which had been more often negative 
in Task 1). The two groups were practically 
equivalent in the rate at which they had learned 
the previous discrimination. In group, 
black had been the more frequently positive 
color for half the rats, and white for the other 
half. 


Group 75 and three rats from Group 25 because 


eac h 


It was necessary to drop one rat from 


of “gate shyness,” leaving 16 and 13 Ss, respec 
tively. ‘There were 80 trials. 

Regarding stimulus orders, in Task 1 one 
color was paired three times with the positive 
form and once with the negative form in each 
run (four trials). In each set of four runs each 
color appeared equally often on the right and on 
the left. 
of Exp. I, 


were random. 


Other controls were similar to those 
and within these restrictions orders 


Results 


In Task 1 the mean of the errors 
was 25.85 (SD = 10.46). This mean 
was not significantly different from 
that of Group E (26.75) or that of 
Group C (31.69) in Exp. I. 

In Task 1 was a greater 
tendency to approach the negative 
form when it was paired with the 
75% color than when it was paired 
with the 25% color (which 


there 


was 
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negative on 75% of the trials). Of 
the 31 rats, 22 made more errors when 
the 75% color was paired with the 
negative stimulus than when the 25% 
stimulus was so paired; only four 
showed a difference in the opposite 
direction; and five showed no differ- 
ence. A test of the hypothesis of 
equal frequency (13 in each direction) 
for the unambiguous cases yielded a 
chi square of 12.46 (P < Ol). 

In Task 2, there was an initial 
tendency to approach the color which 
had been more often positive in Task 
1. In the first eight trials, 12 of the 
13 rats in Group 25 more often ap- 
proached the 75% color than the 25% 
color, and one rat showed no differ- 
ence. Of the 16 rats in Group 75, 11 
responded more often to the 75% 
color in these initial eight trials, two 
to the 25% color, and three showed no 
difference. Combining the groups, 
23 Ss more often responded to the 
75% color and only two to the 25% 
color. A test of the hypothesis of 
equal frequencies yielded a chi square 
of 17.64 (P < 1). In these initial 
eight trials, the mean percentage of 
responses to the 75% color was 70% in 
Group 75 and 74% in Group 25. 

Figure 4 shows error curves for 
Groups 75 and 25 in Task 2. Group 
75 had a mean of only 9.81 errors 
(SD = 4.86), while Group 25 had a 
mean of 27.15 errors (SD = 5.79). 
The difference in means is significant 
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Fic. 4. Errors in Task 2 color discrimination. 


beyond the 1% level (t = 8.50). It 
should be noted that the Task 2 mean 
for Group E (15.08), in which color 
had been irrelevant in Task 1, fell 
between the corresponding means for 
Groups 75 and 25. The difference 
between the means of Group 75 and 
Group E is significant (t = 3.23, 
P < Ol), as was the difference be- 
tween the means of Group 25 and 
Group E (t = 6.66, P < 01). 


Discussion 


Let us evaluate the continuity and 
noncontinuity theories of discrimination 
learning in regard to these results. 
Continuity theory postulates successive 
increments in the excitatory or inhibitory 
potential of a stimulus as a result of 
reinforcement and nonreinforcement, re- 
spectively. Presumably the components 
of a stimulus complex severally acquire 
response __ tendencies. Noncontinuity 
theory postulates an hypothesis-forming 
and hypothesis-testing process. Pre- 
sumably each well-defined component of 
a stimulus complex would be eligible for 
a potential hypothesis. In the main, 
noncontinuity theorists such as Lashley 
(4) prefer to hold that only one hypothe- 
sis at a time is entertained, and that 
during such a period no learning occurs 
with respect to other components of the 
stimulus complex. 

In Exp. I, Group FE learned Task 1 as 
rapidly as did Group C, which shows that 
an irrelevant stimulus does not neces- 
sarily interfere with learning to dis- 
criminate a critical stimulus. A similar 
result was obtained by Elam and Bit- 
terman (3). This finding is consistent 
with continuity theory. Since, on the 
average, responses to the two colors were 
equally often reinforced (and nonrein- 
forced) throughout training, response 
tendencies to colors would have been 
balanced. Therefore, differential tend- 
encies would have developed only with 
respect to form stimuli. Hence Groups 
E and C would have learned at about 
equal rates, which was the outcome. 

But this finding is difficult to reconcile 
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with noncontinuity theory. Having had 
more dimensions of stimulus variation, 
Group E would have had more potential 
hypotheses (some erroneous) to form and 
test than would Group C (e.g., color and 
even joint color-form hypotheses). 
Whether only one hypothesis at a time 
were entertained, or several concurrently 
(a real feat), Group E on the average 
ought to have learned less rapidly then 
Group C, which was contrary to fact. 

Turning to Task 2 of Exp. I, the fact 
that Groups E and C learned at about 
equal rates shows that the previous 
irrelevance of a _ stimulus does not 
necessarily affect its subsequent dis- 
crimination (when the previously critical 
stimulus is absent). This result is con- 
sistent with continuity theory. Since 
response tendencies to colors would have 
been equated on the average in both 
groups at the outset of Task 2, the two 
groups would acquire color discrimi- 
nation at about equal rates. This 
finding poses a problem for noncon- 
tinuity theory, for it would appear that 
the previous irrelevance of a stimulus 
ought to render less probable an hy- 
pothesis about its subsequent relevance, 
and thus retard learning. 

Regarding Exp. II, the fact that in 
Task 1 the Ss were more likely to ap- 
proach the negative form when paired 
with the 75% color than when paired 
with the 25% color follows directly from 
continuity theory. Responses having 
been more often reinforced (and less 
often nonreinforced) to one color than 
to the other, response tendencies to the 
75% color would have been stronger than 
those to the 25% color. It is difficult to 
see how the one-hypothesis-at-a-time 
version of noncontinuity theory could 
account for this finding, especially after 
Ss began responding at better than 
chance to form. The result may be 
reconciled with a more complex version 
which permits of multiple concurrent 
hypotheses at different orders of proba- 
bility—an all-or-none hypothesis about 
form and a probabilistic hypothesis about 
color. 

Turning to Task 2 of Exp. II, Ss 
initially preferred the 75% color, and 


Group 75 learned more rapidly than 
Group 25, findings similar to those of 
Bitterman and Coate (1). Both findings 
follow directly from continuity theory, 
since, as explained previously, differential 
response tendencies to colors would have 
existed at the outset. A continuity view 
would attach no special significance to 
the fact that the mean initial percentage 
of responses to the 75% color was itself 
close to 75%. Since in Task 1 each 
alternative was only partially reinforced 
(and nonreinforced), habit strengths with 
respect to colors would have differ- 
entiated slowly. With more extended 
training on Task 1, the initial per 
centage of responses to the 75% color 
might have been higher in Task 2. 

The foregoing results are difficult to 
reconcile with the version of noncon- 
tinuity theory which restricts learning 
to one hypothesis at a time. For if Ss 
had emerged with a single form hypothe- 
sis in Task 1, what would account for the 
transfer to Task 2 of response tendencies 
to colors? If, however, noncontinuity 


theory were extended to allow not only 
multiple concurrent hypotheses, but also 
hypotheses at different orders of proba- 


bility, then the initial tendency to 
approach the 75% color on nearly three- 
fourths of the trials would follow, and so 
would the advantage of Group 75 in 
learning Task 2. Such an elaboration of 
noncontinuity theory is speculative and 
requires much more evidence in its favor 

As a final comment, the results in Task 
1 of Exp. 1 have implications for the 
perceptual nature of the stimuli. The 
stimulus materials appeared to function 
primarily as color and form elements (four 
in number). Such an assumption was 
employed in applying continuity theory 
to account for the fact that Groups FE. and 
C learned at about equal rates. If forms 
and colors had functioned primarily as 
color-form compounds, Group FE, having 
had four such compounds, would have 
learned less rapidly than Group C, 
having had only two such compounds. 
If the colors and forms had functioned 
primarily as “transverse patterns,” then 
Group E, having had four such patterns, 
would have learned less rapidly than 





94 M. A. JEEVES AND A. J. NORTH 


Group C, which had only two patterns. 
Although learning might well have 
occurred on all three levels of stimulus 
complexity, in view of the outcome 
probably form and color elements were 
the dominant stimulus units. 


SUMMARY 


Two experiments were undertaken to study 
discrimination of stimuli when one component 
is critical and the other is either irrelevant or 
partially correlated. In Exp. | Group E learned 
a form discrimination with color varying ran- 
domly, whereas Group C learned the form 
discrimination with color held constant. Both 
groups learned at about equal rates. ‘Then each 
group learned the color discrimination (with 
forms absent). Again there was no difference. 
An irrelevant stimulus, therefore, does not 
necessarily interfere with learning to discrimi- 
nate a critical stimulus, nor does its previous 
irrelevance subsequently affect its discrimination 
as a critical stimulus (the earlier critical stimulus 
being absent). 

In Exp. II all rats learned the form discrimi 
nation while one color (the “75% color’) was 
paired on 75% of the trials with the positive 
form. More errors occurred on trials in which 
75% color was paired with the negative form 
than when the 25% color was paired with this 
form. ‘Then one group learned to approach the 
75% color while the other learned to approach 


the 25% color. Both groups showed an initial 
tendency to approach the 75% color, and Group 
75 learned more rapidly than Group 25. Partial 
correlation of a stimulus, therefore does not 
necessarily interfere with or enhance learning to 
discriminate a critical stimulus, and the subse- 
quent discrimination of such a stimulus depends 
on whether it previously had been more often 
positive or negative. 

The implications of the findings for the 
continuity and noncontinuity theories of dis- 
crimination learning were stated. The con- 
tinuity theory is favored by these data. 
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THE EFFECT OF THE INITIAL REINFORCEMENT ON 
RESPONSE TENDENCY 
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It is possible to reject a drive 
reduction theory of reinforcement 
while adhering to a position in which 
food for a hungry animal brings about 
the learning of an instrumental re- 
sponse (2). When we take this stand, 
there are three alternative expla- 
nations as to the nature of rein- 
forcement. 

One possibility is that the presence 
of a goal object (or its surrogate) 
confirms (or enhances) an organism’s 
expectancy for this particular goal 
object. Such an interpretation has 
been suggested by White (7). Ac- 
cording to this position, no instru- 
mental-reward learning can occur 
until an expectancy for the goal object 


in the situation has first been estab- 


lished. In Hullian terms this view- 
point means that the translation of a 
fractional consummatory response 
(ra) into a more intense rg or the 
consummatory response (Rg) con- 
stitutes the mechanism of reinforce- 
ment. 

The second possibility is simply 
that the eliciting of a consummatory 
response or its rq strengthens a prior 
instrumental act, as seems to be 
implied by Sheffield and his associates 
(5, 6). 

Both of these positions are in the 
“law of effect” tradition in that they 
imply that the reinforcing state of 
affairs strengthens the response tend- 
ency of a preceding instrumental 
response. The third possibility, on 
the other hand, does not involve the 

!'The experiment was conducted by R. L. 


Martindale as a requirement for the M.A. 
degree, 1950, Michigan State College. 
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strengthening of a response which 
occurs prior to the reinforcing state 
of affairs. Reinforcement or the 
strengthening of any response tend- 
ency occurs (simultaneously) with 
the elicitation of the response in 
question. For example, the approach 
response characteristically elicited by 
food in a goal box is conditioned to 
contiguous stimuli; thus the rat learns 
to approach the correct side of a T 
maze (2). This third alternative is a 
contiguity notion in which a goal 
object is important to learning in 
terms of the kind of it 
consistently elicits. 

The present research was directed 
at differentiating between the first 
alternative and the last two, although 
the data appear to shed some light on 
distinguishing between the second and 
third alternatives as well. Thus, we 
need to answer the question, do naive 
animals show any learning on the very 
first rewarded trial. It might seem 
that this point could be determined 
by an examination of the literature 
But preliminary training procedures 
as well as other methodological con- 
siderations obscure an adequate ap- 
praisal of this question. In order to 
test our hypothesis that the very first 
presentation of the goal object can 
contribute to the final response tend- 
ency, it is necessary to set up con- 
ditions for sensitive measurement of 
the effects of the first trial 
differential response situation. 

Using a T maze and reversing the 
direction of the correct response after 
the first rewarded trial, Moore (4) 
made a first attempt at such a test. 


response 


in a 
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The results, though clear, were con- 
founded. The first reward contrib- 
uted nothing while the second reward 
produced a significant increment to 
the response tendency. However, the 
goal object consisted of a single, small, 
familiar pellet of food in a glass 
coaster, and it was not uncommon for 
a naive rat on the first rewarded trial 
to take a minute or so to consume it. 
Since the animals initially had dif- 
ficulty in finding the food and, 
therefore, no immediate “secondary 
reinforcement” through sight of the 
food, the experiment could not be 
considered an adequate test of the 
hypothesis. 

In the present experiment special 
care was taken to insure immediate 
“secondary reinforcement” of the 
correct response. Also, the hypothe- 


sis was further evaluated by com- 
paring the effect of the initial reward 
on naive animals and on animals 
which already had a food association 


established to the maze situation. If 
control of the variable of food expect- 
ancy is important, then the sophisti- 
cated animals should show more 
learning with the initial reinforcement 
than the naive. 


Main Experiment 
Method 


Subjects. —The Ss were 105 experimentally 
naive, male and female albino rats from the 
colony of the psychological laboratory of 
Michigan State University. Age varied from 
@& to 100 days; the range and median age were 
practically identical for all conditions. Mean 
age was approximately 110 days at the begin- 
ning of the experiment. 

Apparatus.—The enclosed T maze (Fig. 1) 
was designed to make the delay of reward as 
short as possible, as the response at the choice 
point was a continuous turn ending in the end 
box. The curved surfaces were constructed of 
heavy-gauge galvanized metal while the re- 
mainder was constructed of wood. All walls 
were 8 in. high. All the interior surfaces except 
the end boxes were painted a neutral gray. 





Seccceca 

















Fic. 1. Floor plan of the apparatus. 


Except for the starting box, which had an 
opaque cover, the maze was covered with 
hardware cloth. ‘The doors at the choice point 
were operated swiftly by a special release 
mechanism. When S’s nose crossed the response 
criterion line, the choice-point door and the door 
farther from the selected end box closed simul- 
taneously, preventing any retracing or change of 
response. As soon as S had completely entered 
an end box, the door nearest the selected end 
box was also closed. The starting box had a 
conventional pull door, and all doors were 
padded at the bottom with sponge rubber. 
There were two forcing blocks the same height 
and color as the rest of the maze which could be 
used to block off either alley at the choice point. 
The end boxes differed in shape as illustrated in 
Fig. 1. The negative end box was lined on the 
floor and sides with white posterboard in such a 
way a3 to leave a }-in. space between the poster- 
board and the wooden walls of the box. In 
order to control for food odor, this space was 
filled with crushed food identical to that used 
in the positive goal box. The positive goal box 
was lined with black posterboard and the entire 
floor was covered with Purina Checkers of the 
same type as Ss were normally fed. Poster- 
board was used rather than paint to control for 
any differential paint odor. A large rectangular 
goal box was used in preliminary training with 





INITIAL REINFORCEMENT 97 


certain groups. It was painted a neutral gray, 
covered with hardware cloth, and placed at the 
end of the straight alley. 

The maze, in general, was lighted by a 200-w. 
bulb placed directly over the center of the 
straight alley, 68 in. from the alley floor. A 
&)-w. bulb with reflector was placed 4 in. from 
the top of the maze directly overeach arm. Any 
illumination difference visible from the choice 
point was barely, if at all, discernible to £ 
All Ss 
were given two days of habituation consisting 
of handling by EZ and running free on a small 


Habituation and preliminary training. 


table. On the first day all Ss were given an 
unlimited amount of food in their home cage and 
on the second day were fed nothing. Water was 
available throughout the entire experiment. A 
5-day preliminary training period followed in 
which the Ss were divided equally into a Food 
Expectancy (F) group and a No Food Expec- 
tancy (NF) group 
briefly by E and allowed to run free on a small 
table. These Ss were fed 7 g. of food the first 


day and 4} g. on the remaining days 


The NF group was handled 


of pre 


liminary training. The Ss of the F group were 


placed in the starting box and allowed to run 
down the straightway to the preliminary training 
goal box which was littered with food, and each 
S was The S 
received two massed trials per day and was fed 
64 g. the first day and 4 g. 
four 


deprivation level prevailed for both groups by 


allowed 30 sex eating time. 


on each of the re 


maining Obviously, a very high 


the end of this period The Ss in both groups 


were fed dual cages 10 min. after the 
end of the di 
Training 


ninary tramimng grouy 


was divided into three training conditions: 
Control (C), Experimenta!-1 (E-1), and Experi 
mental-2 (F-2 

designated NF-( 
and FE-2. The trained 
consistently to one side of the T maze; for the 


position of the 


1 his waveat tal of Six groups 
NFE-1, NFE-2, F-C, FE-1, 


control Ss were all 


E-1 groups the positive and 


negative end boxes was reversed after one free 
and one forced which only one was 


correct and rewarded; for the E-2 groups the 
position of the end boxes was reversed after two 
free and two forced trials of which only two were 
correct and rewarded 

The C and E 


initial position preference 
With 16 Ss in each subgroup 


] groups were equated for sex, 
and position of the 
positive end box 
this gave two Ss under each possible combination 
of con litions 

NFE-2 and FE-2 were equated for position 
variables 


preference only, but the other two 


were equated as much as possible. There were 
14 Ss in each of these groups. Because of the 


equation of position preference on the basis of 


the first training trial most Ss were not assigned 
until the first trial. This 
technique made possible the beginning of all 


to a group after 
groups at the point of 50% correct response. 
During training, S was given four trials per 
day for four days. Two of each day's trials 
were to the negative end box and two were to 
the positive box, thus each S had two reinforced 
and two nonreinforced The 
trials of thie first day were alternately free and 
forced starting with a free trial 


trials per day? 


On succeeding 
days the first two were free and the second two 
forced, with only these combinations possible 
LLRR, RRLL, RLLR, and LRRL 


rewarded with a w sec. cating period in the goal 


The S was 


box. On trials to the positive side S was timed 
from the moment its nose touched the criterion 
The S remained in 
The 
intertrial interval was approximately 30 min 
All Ss were fed individually 10 min. after the last 
trial of the day 


line until it began to eat 


the negative end box 0 sec. after entering 


They were given 4 g. on the 
first two days and 54 g. on the last two days 
Thirteen disc arded Ss were replac ed with like 
animals giving a net N of 92 Ss Animals were 
discarded for any one of the following reasons 
refusal to eat during preliminary training on all 
trials of the first four days or on either trial of 
the fifth day, refusal to eat within 5 min. on the 
first trial of the training session, and refusal to 


! f 


eat on both rewarded trials « any subse juent 


day of training 


Results 


The main measures employed were 
the number of correct responses on the 
free trials and the mean latency to eat 


after S touched the response « riterion 


W herey er 


line. analysis of Variance 


was used, the requirement for homo- 


geneity was satisfied. An analysis 


2 Trials to the negative side were necessary 


in order to ext loratory tendencies 


W hen the first trial to the negative & le followed 


equate 


a rewarded trial, it 18 vable that nonreimn 


conce 

forcement of the incorrect response contributed 
to make a correct res 
the second free trial. ‘This is unlikely, however, 
The third-order 
sition preference X sex K position of 
y ie| Is on I less | 


able evidence (1) 


to the tendency ponse on 


for various reasons inter 


action (f 
end box) than 1, and the avai 


indicates that nonreinforce 


ment of the wrong response influences the 


learn ng of the correct response only alter learn 
ing has progressed a few trials and then only 
when the tendency to make a wrong response is 


fairly strong 
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TABLE 1 


Number oy Correct Responses ror 
Totat Traminc Person 


Groupe N 


Control (C) 


Mean) SD | Difference 


—— 


32| 7.12 | 1, 
|32|6.25| .87| C-E-2 
28 | 5.65 | 1.14) E-l—E-2 


of variance of the number of correct 
responses for the total training period 
indicates that there is no significant 
difference between preliminary train- 
ing conditions (F < 1, df = 1 and 44) 
and a highly significant difference 
between training conditions (F 
= 12.48, df= 2 and 44, P < Ol). 
Chi square for the largest single 
difference between preliminary train- 
ing conditions on any one free trial, 
Trial 2, is also insignificant (x? = 1.3, 
P = 30). This is true even though 
the preliminary training variable was 
effective in producing significantly 
different latencies to eat, as will be 
shown later. Consequently, our hy- 
pothesis has received initial support 
and the preliminary training con- 
ditions were combined for the re- 
maining analyses of learning. The 
mean number of correct responses for 
the combined groups for the total 
training period is presented in Table 
1, together with P values. 


Besssasss 








PER CENT CORRECT RESPONSE 


FREE TRIALS 


Learning curves of the three training 
preliminary training conditions 


Fia, 2. 
groups, 
combined, 


The learning curves for the three 
training conditions are presented in 
Fig. 2. As apparent in Fig. 2, the 
effect of the initial reward is manifest 
on the second free trial. This incre- 
ment is highly significant. On the 
second free trial 67 of 92 Ss repeated 
the response that was correct on the 
first rewarded trial (x? = 9.18, P 
= 1). The data clearly indicate 
that learning occurred on the basis 
of the initial reinforcement. A more 
complete analysis of the effect of the 
second reward would have been 
important if the initial reward had 
had no effect. However, since the 
E-2 data show that the second reward 
had an additional effect, they do 
provide further confirmation of our 
hypothesis. 

Figure 3 shows the mean latency 
of the eating response on each of the 
training days for the two preliminary 
training conditions. In the analysis 
of variance the assumption of homo- 
geneity was satisfied by using the 


60) 








2 3 
TRAINING DAYS 


Fic. 3. Latency of the eating response 
during training for the two preliminary training 
conditions. 
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mean latency per day per animal 
rather than the mean latency over the 
total training period. The residual 
variance due te days was removed 
from the error term. The sophisti- 
cated F groups ate significantly sooner 
than the naive NF groups (F = 6.31, 
df = 1 and 86, P = .02), but there is 
no significant difference in eating 
latency between training conditions 
(F <1, df =2 and 86). The im- 
portance of this finding lies in the 
fact that this time difference does not 
appear to affect the speed of learning. 
Essentially the same number of Ss 
above (34) and below (33) the median 
eating latency of 48.5 sec. on the first 
trial went “correctly” on the next free 
trial. 
Ss the rank correlation coefficient 
between mean latency and number of 
correct responses for the total training 
period is .20 and for the naive controls 
is — .02. Both Ps fall far short of 
significance. 


For the sophisticated control 


There exists a possible flaw in the 
experiment with respect to our inter- 
pretation of the learning in terms of 
elicited or reinforced response. It may 
be that the making of an approach 
response to the cues which immediately 
preceded reward was not an essential 
condition for the learning that took place 
on the first recorded trial. The learning 
may been purely perceptual or 
cognitive, Ala Tolman. For, due to the 
structure of the apparatus, when S was 
eating in the positive end box or alley 
many of the same stimuli, which were 
also visible from the choice point, were 
associated with eating and may have 
acquired approach or expectancy value. 
Thus the possibility remains that 8 
responded correctly on the second free 
trial on the basis of the expectancy value 
of these cues rather than on the basis of 
having made a rewarded turn response. 
A subexperiment was designed to test 
this possibility. 


ha ve 


SUBEXPERIMENT 
Method 


The Ss were 12 male and 12 female albino 
rats, varying in age from 80 to 105 days. 
Through preliminary training, the procedure 
was exactly the same as for the naive groups of 
the main experiment. The training period con- 
sisted of a single day. The S was placed in the 
starting box and allowed to run down the stem 
to the closed choice-point door. Then 
lifted out of the stem by E and placed in either 
the positive or the negative end box. On this 
first trial, 12 Ss, 6 male and 6 female, were 
placed individually in the positive box with all 
doors closed. For 3 females the 
positive box was on the right and for the other 
6 it was on the left. The S was allowed to eat 
in the goal box for 30 sec. as in the main experi 
ment. Approximately 30 min. later each S was 
given a similar trial except this time S was 
placed in the negative end box for a period of 30 
sec. without food, Exactly the same procedure 
was followed with the other 12 Ss except that 
they were placed in the negative box first and 
the goal box second. Approximately 30 min. 
after the last training trial each S was given one 
free test trial in which the end boxes were in the 
same position as they had been during training 


S was 


3 males and 


Re sults 


Ten of the 24 Ss went in the direc- 
tion of the positive end box on the free 
trial. This proportion of correct re- 
sponses is significantly smaller than 
67 out of 92 on the comparable trial 
of the main experiment (x? = 43.3, 
P = 0O1). Therefore, there is no 
evidence that Ss learned the location 
of the goal box when the cues of the 
choice-point region were associated 
with food and eating on the first trial 
to the positive side in much the same 
way as occurred in the main experi- 
ment after S had arrived at the goal 
box. The conclusion of the main 
experiment need not be qualified. 


Discussion 


Since the present results support the 
hypothesis that the learning of an 
instrumental response can occur on the 
very first rewarded trial, they refute the 
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hypothesis that the reinforcement of an 
instrumental response is dependent upon 
the confirmation of a previously estab- 
lished expectancy. 

Coupled with the fact that Moore (4), 
who used a single, small reward pellet, 
found no evidence of an increment from 
the first reward, the present results 
attest to the regular operation of some 
such principle as immediate secondary 
reinforcement in instrumental-reward 


learning. There is no basis in either 
study to claim the operation of such a 
principle as primary reinforcement. 
Without immediate “secondary rein- 


forcement” there is no learning, and when 
there is learning, the latency of the eating 
response is long and unrelated to the 
speed of learning. Moreover, the large 
initial increment and the very rapid 
learning observed in the present experi- 
ment (essentially a 100% level of correct 
response after only five rewarded trials 
in all groups) attest to the immediacy 
and effectiveness of the 
reinforcement.” 

Also, the results of this study support 
a shift in reinforcement theory. For the 
results seem to offer an inkling as to the 
nature of reinforcement beyond what is 
indicated by the confirmation of the 
present hypothesis. Since the animals 
learn to eat in the goal box no faster than 
they learn the turn response, there is the 
suggestion here that the essential prop- 
erty of “secondary reinforcement,” or 
reinforcement per se is an immediate 
approach to the food. In this way, in- 
strumental-reward learning can be re- 
garded as an example of classical con- 
ditioning. The unconditioned response 
is the response of approaching the goal 
object, and successive elements of this 
approach pattern become consistently 
and discriminatively conditioned to the 
cue stimuli of the maze (2). 


“secondary 


SUMMARY 


In the main experiment, 92 rats served as Ss 
in a simple T-maze learning situation in which 
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the turn response was assured of immediate 
“secondary reinforcement.” Half of the Ss had 
a prior food association established to part of 
the maze and half were naive. All Ss were given 
four trials a day for four days, two free and two 
forced, two reinforced and two nonreinforced. 
Position preference and the position of the 
positive end box were equated. On the second 
free trial there was a significant increment to 
the response tendency of the response which had 
been given one initial reward. There was no 
significant difference in the learning of the naive 
and sophisticated groups even though the latter 
ate significantly sooner than the former upon 
arrival at the goal box where the floor was 
littered with familiar food. 

In a subexperiment 24 rats which had one 
trial to both the positive and negative sides, 
but were carried there instead of running there, 
did not learn. 

The relevance of the results to reinforcement 
theory was discussed. 


REFERENCES 


1. Denny, M. R., & Dunnam, M. D. The 
effect of differential nonreinforcement of 
the incorrect response on the learning of 
the correct response in the simple T-maze. 


J. exp. Psychol., 1951, 41, 382-389. 


2. Denny, M. R., & Apetman, H. M.  Elici- 
tation theory: I. An analysis of two 
typical learning situations. Prychol. 
Rev., 1955, 62, 290-296. 

3. Hui, C. L.° Principles of behavior. New 


York: D. Appleton-Century, 1943. 

4. Moore, D. N. An experimental study of 
the effect of position reversal after one or 
two reinforcements on simple T-maze 


learning in the rat. Unpublished mas- 


ter’s thesis, Michigan State College, 
1949, 

5. Suerrievp, F., & Rosy, T. Reward value 
of a non-nutritive sweet taste. J. comp. 
physiol. Psychol., 1950, 43, 471-481 

6. Suerrierp, F., Wuurr, |., & Backer, R. 


Reward value of copulation without sex 
drive reduction : comp. physiol. Psy- 
chol., 1951, 44, 3-9. 

7. Wuire, R. K. The completion hypothesis 
and reinforcement. Psychol. Rev., 1936, 


43, 396-404. 


(Received August 25, 1955) 








Journal of Experimental Psychology 
Vol. 52, No. 2, 1956 


THE ROLE OF EXPECTANCY IN DELAYED 
REINFORCEMENT! 


ALVIN R. MAHRER 


Ohio State University * 


To a large extent, human behavior 
is characterized by apparent choices 
between immediate and delayed re- 
wards and punishments. Reviews of 
animal learning studies (4, 5, 8) offer 
two findings: (a) immediate rewards 
are preferred over delayed rewards, 
and (b) the effectiveness of a reward 
diminishes with 


or a punishment 


increasing delay. These conclusions 
are supported by the comparatively 
little research on delayed rewards in 
human behavior. These studies dealt 
with performance on verbal mazes 
(14), learning (1, 12), 
solution of conflict situations (2, 3, 9, 
10), and selection of object prefer- 
ences (6, 7). 

The findings of 
research have 
for extensive 


classroom 


delayed reward 
provided the 
clinical applications. 
Mowrer and Ullman (11) for example, 
considered the delay of rewards and 
punishments to be a critical factor in 
the development of 
criminal 


basis 


neurotic and 
Shaw has incor- 
porated the idea of delayed reward 
into proposed postulates of 
He said (15, p. 426): 


“The problems dealt with in psycho- 


behavior. 


some 
psychotherapy. 


therapy are in the nature of behavior 


which has immediately rewarding 


consequences, but more remote pun- 
ishing consequences.” 

The proposed theoretical explana- 
tions of the two conclusions to delayed 


! This paper is based on a dissertation sub 
mitted in partial fulfillment of the requirements 
for the Ph.D. degree in Psychology at the Ohio 
State University. The author is 
grateful to Dr. Julian Rotter. 

2 Now at Fitzsimons Army Hospital. 


especially 


reward research have focused on the 
value of the reward. 

They state, in effect, that a delay 
of a reward results in a decline of the 
value of that With a 
creased value, learning will be less 
effective and immediate rewards will 


reward. de- 


be chosen over delayed rewards 

The present investigation was car- 
ried within the 
Rotter’s social learning theory 


out framework of 


(13). 
This theory distinguishes between the 
value of a reward or reinforcement, 
and the expectancy for occurrence of 
the reinforcement. 
each of follows: 
The value of any 
external reinforcement may be ideally 
defined as the degree of preference for 
any occur if the 
possibilities of their occurring were 


The definition for 


these two terms 


reinforcement 


reinforcement to 
all equal. Expectancy may be 
defined as the probability held by the 
individual that a particular external 
reinforcement will occur as a function 


of a specific behavior on his part in a 


(13, 


specific situation or situations” 
p. 107). 

In contrast to previous explana- 
tions, which focused on an hypothe- 
sized decline in the value of a delayed 
the purpose of the 
investigation is to introduce 


reward, present 

(Xx pec- 
tancy as a factor in delayed reinforce- 
ment Since are 
several meanings of “‘delayed rein- 
forcement” 


situations. there 
and “delayed reinforce- 
should be 
emphasized that this study is con- 
cerned with 


ment situations,” it 


human behavior in a 


situation involving an apparent ¢ hoice 
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between immediate 
reinforcements. 

The present study hypothesized 
that in the course of previous ex- 
periences with delayed reinforcements, 
an individual will develop some level 
of probability (expectancy) that de- 
layed reinforcements will occur. The 
level of the probability is presumed to 
be an important factor in the prefer- 
ence strength for the immediate or 
delayed reinforcement. Rotter (13) 
suggested that a decrease in the 
expectancy of occurrence of delayed 
reinforcements will result in a decrease 
in the strength of the preference for 
these reinforcements. The first pur- 
pose of this study was to develop 
varying levels of expectancy of occur- 
rence of delayed reinforcements, in 
order to determine whether or not 
corresponding changes in the strength 
of the preference for delayed rein- 
forcements, as compared with im- 
mediate reinforcements, would occur. 

The second purpose was to investi- 
gate the extent to which training by 
one E generalized to a second EE. It 
was hypothesized that social agents, 
perceived as instrumental to the 
occurrence or nonoccurrence of the 
delayed reinforcements, served as one 
of the important cues for the level of 
the expectancy. Different social 
agents would then be presumed to 
serve as cues for different levels of 
expectancy. 


and delayed 


Hypotheses 


When £, offers the reinforcements 
there will be a regular decline in the 
choice of delayed reinforcements for 
high, moderate, and low expectancy 
groups respectively. Also, the high 
and low expectancy groups will have 
a respectively higher and lower choice 
of delayed reinforcements than will 
their respective control groups. In 
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order to test the generalization of the 
effects of training, these hypotheses 
also apply to the situation where Ex, 
offers the reinforcements. It is ex- 
pected that there will be significant 
differences between E,’s subgroups 
and those of Ex. 


MetTuop 


Subjects.—Forty-five Ss 
preliminary research. ‘There were 137 experi- 
mental and 97 control Ss. These Ss were 
matched, as far as feasible, in variables which 
appeared to be correlated with both the values 
of the reinforcements and the expectancy for 
occurrence of delayed reinforcements. These 
variables were: sex, age, grade, and socio- 
economic status. All Ss were males, 7 to 9, in 
Grades 2 and 3 of public schools located in a 
lower socioeconomic area of Columbus, Ohio. 

Preliminary study.—A preliminary study was 
used to select two reinforcements which so 
differed in reinforcement value that approxi- 
mately one-half of a group would choose the 
reinforcement of lower value when it was to be 
given to them immediately, and one-half of the 
group would choose the reinforcement of higher 
value when it was to be given to them a day 
later. With this pair of reinforcements, meas- 
urements could be made over a wide range of 
increases or decreases in the choice of rein- 
forcements. 

To select this pair, a measurement of the 
reinforcement value of a group of 11 toys was 
obtained by a rank-preference method. By 
progressively eliminating combinations of toys, 
a pair emerged in which half of the group chose 
the immediate reinforcement of lesser value, and 
half chose the delayed reinforcement of higher 
value. The immediate reinforcement was a 
small metal airplane, and the delayed reinforce- 
ment was a small, plastic “flying saucer.” As 
a reliability check on the reinforcement values 
of these two toys, 25 additional Ss chose the 
fiying saucer (delayed choice) in preference to 
the airplane. 

Experimental groups.—Three experimental 
groups were trained at three different schools, 
to prevent any effects of communication. To 
develop three levels of expectancy for the occur- 
rence of delayed reinforcements, a series of five 
training trials was used. Following a pre- 
liminary activity, each S was told that he would 
receive a small balloon on the following day. 
This procedure was followed for five consecutive 
days, so that there was a total of four possible 
reinforcements. ‘To develop the high, moderate, 


were used in a 
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and low expectancy for delayed reinforcements, 
four, two, and none of the promised reinforce- 
ments were received, respectively. 

Three days following the training trials, each 
S was offered 
delayed 


his choice of an immediate or 
reinforcement. In this, each 
experimental group was divided into two sub 
groups 


doing 


One of these subgroups was given the 
choice by £4, who was the £ during the training 
trials; the other subgroup was given the choice 
by Ep, who had no prior contact with the groups. 
To maximize the differences between FE, and 
Ey, (a) the Es differed in age and sex; (b) three 
days intervened between training and the choice 
situation; (c) no mention of the choice situation 
was made during the training; and (d) in the 
choice situation the two Es saw their Ss simul 
taneously in different rooms so that communi- 
cation between groups was prevented 
Instructions.—Each S was seen individually 
On the first training day, for all Ss, the E said 
“IT am from a company that makes things for 
boys, and I am going to some of the schools 
Would you help me with these pictures? 
These are coloring pictures. I want to know 
which is the best coloring picture . 
. or this one. 


this one 
Which is the best picture to 
Be 
Would like me to bring you 
a balloon tomorrow? .. . All right, I will bring 
you a balloon tomorrow.” 


color? ... Thanks for helping me. 


you like balloons? 


On the next training day, if S was to receive 
the promised reinforcement, E said: 

“I promised you a balloon today, and here 
it is. . . . Would you help me with these two 


other pictures today? ... Which is the best 


coloring picture?” 

The followed, 
again promising a balloon the next day. 

If S was not to receive the promised rein- 
forcement, FE said, “I am sorry, but I just don’t 
have any balloons today....” He then 
continued with the above instructions. 

On the test day, the instructions given by 
E,, who had trained all the experimental groups, 
were as follows: 


previous instructions were 


“T told you that I was from a company that 
makes things for children. like 
Do you know 
I am going to give you one of 
Would you like that? If you want 
this one (e.g., the airplane) you can have it 
right now. If you want this one (the flying 


.. Do you 
toys? ... Here are two toys. 
what they are? 


these toys. 


saucer), it is the only one I brought, so you can 
have it tomorrow. And I will see your class 
tomorrow to see that each boy got what he 
chose. . . . Do you understand? . . . All right, 
which one will you choose?” 


If necessary, these instructions were repeated. 
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The instructions given by Ep required an 
introduction, since this was the 
with her “Hello, my 
Mrs._—__.. What is your name? I am 
here to see some of the children in your school 

Do you like toys? Fron this 
point on the instructions used by both Es were 
identical, 


first contact 


subgroup: name is 


Control groups.—Control groups were used in 
order to control for possible differences in the 
expectancies for occurrence of delayed reinforce 
These 
groups were given the final choice of immediate 
or delayed reinforcements without any training 
sequence 


ments prior to the training sequence 


One control group was used at each 
of the three schools, and each was given the 


choice of reinforcements by E,. In order to 


investigate the generalization or specificity of 


the expectancy for delayed reinforcements, a 


fourth control group was used at an additional 


school. This group was given the choice of 


immediate or delayed reinforcements by Fy. 
To minimize and control any effects of com- 
munication, the control groups were seen two 
weeks prior to the beginning of training of the 
experimental groups at each school. 


ResuLts 


Three control groups were given the 
choice of reinforcements by E,. The 
chi-square values for comparisons 
among these groups were .205, .O84, 
and .031. Since none of these values 
was statistically significant, these 
three groups were pooled, enabling 
chi-square tests to be made between 
experimental groups and the pooled 
control groups, and also among the 
experimental groups directly 

The basic data are given in Table 1 
and in Fig. 1. A P value of .05 was 
taken as statistically significant. The 
high expectancy group had a sig- 
nificantly greater choice of delayed 
reinforcements than did either the 
moderate expectancy group (.02 > P 
> .O1) or the low expectancy group 
(P < .0O1) when Ex, offered the rein- 
forcements. The moderate expec- 
tancy group had an insignificantly 
greater choice*of delayed reinforce- 
ments than did the low expectancy 
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TABLE 1 


Cuoices or Immepiate AND Detavep ReinrorcemeNtT 


Number } 





Expectancy 


Se Choosing | High Boderate Low } donne Control 
ontro 
Reinforcement - 7 a 8 
: | } 
( trol | Contro 
Ex -| Be |Control) py Ep |\Control! pf, En oe 
Immediate | 6 6 12 5 5 12 18 7 10 ‘4 10 
Delayed | 19 \4 ll il 15 13 6 15 2 6 16 
Total 25 20 23 26 20 25 24 22 22 70 26 


group (.30 > P > .20) when offered 
the reinforcements. 

The high and low expectancy 
groups had a respectively greater 
(.05 > P > .02) and lower (.02 > P 
> .O1) choice of delayed reinforce- 
ments than did their respective con- 


trol groups, when E£, offered the 
reinforcements. There were no sig- 
nificant differences in the choice of 


delayed reinforcements by the high, 
moderate, and low expectancy groups 
when Ey offered the reinforcements 
(80 > P > .50). There were no sig- 
nificant differences in the choice of 
delayed reinforcements by the high 
expectancy group and by its control 


o 80 
°°] 
ie cae 
Fr _ 8 EXPERIMENTAL 
© 704 a“ . 
Q 2 
¥ 4 
5 8 CONTROL 
& 604 . _— 
— A CONTROL 

. Ab 
8 50 | er 
‘4 - 
a * 
8 ao4 A EXPERIMENTAL 
> 4 
% 
' so4 
z 
iw { 
le . 
e 20 Reon —- + - a —_— 4— ———— 

4 2 6 
TRIALS IN WHICH 
PROMISED REWARDS WERE GIVEN 
Fic. 1. Choice of immediate and delayed 


reinforcements by experimental and control 


Ss. 








group (.70 > P > .50) and by the low 
expectancy group and by its control 
group (.70> P> .50), when Ep, 
offered the reinforcements. 

The low and moderate expectancy 
subgroups offered the choice by Ea, 
had a significantly lower choice of 
reinforcements than the respective 
subgroups, offered the choice by Ey 
(P< O01; 05> P> 02). There 
were no significant differences be- 
tween the respective high expectancy 
subgroups (P 30). There were no 
significant differences between the 
pooled control groups, offered the 
choice by Eq, and the control group, 
offered the choice by Ex. 


Discussion 


The results of this study suggest that 
preference strength for a delayed 
reinforcement may be a direct function 
of the expectancy for delayed reinforce- 
ment. The apparent preference for 
immediate reinforcements, and the ap 
parent decline in the effectiveness of a 
delayed reinforcement may, to a large 
extent, be a function of 
expectancy. 

At least three implications may be 
drawn from these results. First, the 
effectiveness of a delayed reward or 
punishment may be increased, not neces 
sarily by shortening the delay, but 
rather by increasing the expectancy that 
delayed reinforcements will occur. Sec- 


a 


a lowered 








EXPECTANCY 


ond, depending on the nature of the 
training, an increased delay may function 
as a cue for a higher, lower, or the same 
expectancy for the occurrence of delayed 
rewards. Third, the preference strength 
for delayed reintorcements may be in- 
creased or decreased by increasing or 
decreasing expectancy while the value 
of the reinforcement remains constant. 
Generalization of expectancy.—The re- 
sults indicated that the effects of training 
with Ex, failed to Ep 
Instead, there were uniform reactions to 
Ep independent of the kind of training 
with Eq. 
form reactions to Ey whether or not the 


generalize to 


Furthermore, there were uni 


Ss had undergone prior training at all. 
Since the training was effective in raising 
the the 
experimental groups, the lack of generali 
that 

already 


or lowering expectancies of 


may indicate 
Ex 
developed, so 
fail to 
influence on the expectancies relating to 
Ep 


According to 


Zation expectancies 
highly 


BE A 


appre iable 


relating to were 
that 


exert 


training with 


would any 


this interpretation, be 
havior in a delayed reinforcement situ 
ation depends on the cues for various 
levels of expectancy for the occurrence 
Specific ally, 


the results suggest that the two Fs served 


of delayed reinforcements. 


as cues for different levels of expectancy 
The implication is that delayed reinforce 
ment behavior in general depends not 
only on the value of the reinforcements, 
but the for the 
occurrence of delaved reinfore ements 4 
related to the The 
effectiveness of a delayed reinforcement 


also on expectancy 


octal agent involved. 


presumably will vary depending on the 


social agent who serves as the cue for 


the expectancy level. 


SUMMARY 


study introduces 


This Rotter’s 


construct as an important factor in delayed 


expectancy 
reinforcement situations. The hypotheses were: 
(a) other factors being equal, the preference 
reward low, 
moderate, or high as the expectancy for the 


strength for a delayed will be 


occurrence of the delayed rewards is respectively 


low, moderate, or high; (6) social agents (i.e., 
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Es) will serve as cues for different levels of 
expectancy. 
The Ss 204 


7 to 9 yr. old, 


were second- and third-grade 


males, To select an immediate 
and delaved reward, a sequential analysis was 
ka then 


trained three experimental groups to develoy 


made of the toy preferences of 70 Ss 


high, moderate, and low expectancies for the 
Three days 


later, one-half of the experimental group was 


occurrence of delayed rewards 


given a choice of an immediate or delayed reward 
by E,. and one half by En, w th whom they had 
had no previous contact 


The results show: 


1, Expectancy for reinforcements 


may be developed or difter yas of 


experiences in which dela w future rein 
forcements either occur or do not occur 
2. In 


expectancy for 


ent or tric 


the leveloy ! lifie atw 


of the 


reinlorcement f other 


behavior 


selecting delayed reinforcements will 


delaye 


factors are equal, the potential for 
ncerceaste 
dec reaae with a relativ ¢ increase of lecre ise 


the expectancy for those delayed re lorcemen 
to occur 
3 In the psychological 


of the 
important cues for the level of the expectancy 
reinforcements is the 
related to the 
the delayed 
The results suggest that d flerent 


situation, one 


for delayed social agent 


perceived to be occurrence of 
non-occurrence of reinforcement 

“ ial avent 
different levels of ex 


may serve as cues for 


pectancy of delayed reinforcement there 
fore, different behavior potentials 


delayed reinforcements over imme 
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STIMULUS PRETRAINING AND SUBSEQUENT 
PERFORMANCE IN THE DELAYED 
REACTION EXPERIMENT 
CHARLES C. SPIKER! 
lowa Child Welfare Research Station 


Several investigators have reported 
studies of the delayed reaction in 
children (1, 2, 6, 10, 11, 12, 14), 
although much remains to be done in 
discovering the relevant experimental 
variables and in specifying the theo- 
retical significance of the delayed 
reaction experiment itself. One of 
the factors that has been suggested 
by Hunter (10, 11) and by Skalet (14) 
as influencing the correct performance 
of children in the delayed reaction 
experiment, as well as in determining 
the length of delay that can be 


tolerated by Ss without interfering 
with the correctness of the perform- 


ance, is the presence of symbolic or 
linguistic responses to represent the 
relevant stimulus features. Skalet 
reported that her child Ss could 
tolerate longer delays with stimuli 
for which the Ss had names (familiar 
animal forms) than for stimuli for 
which the Ss did not have names 
(geometrical figures). She concluded 
that the children’s abilities to repre- 
sent the animal forms with symbolic 
responses in some way facilitated 
recall following delay. 

Although the writer is unaware of 
published studies of the relationship 
between stimulus simiiarity and the 
correctness of the delayed reaction or 
the length of delay that can be 
tolerated by S, this variable is un- 
doubtedly relevant. In the extreme 
case, if identical stimuli were used, S 


The writer wishes to thank his research 
assistant, Miss Ruth B. Holton, for suggestions 
concerning experimental procedures and for aid 
in collecting the data. 
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would have no basis at all for a correct 
response following the delay. It 
conceivable that the animal forms 
which Skalet used were less similar 
than the nonsense figures, and there- 
fore that the better performance with 
the former to stimulus dis- 
tinctiveness rather than to possession 
of names. 

The present experiment is con- 
cerned with a more direct test than 
Skalet’s of the hypothesis that the 
possession by children of names for 
the stimuli leads to more correct 
choices following a delay period. In 
the present design, half the Ss are 
taught different names for two similar 
stimuli in a pretraining session, while 
the other half are given discriminative 
experience with the same stimuli but 
without learning names for them. 
All Ss are then run in a delayed re- 
action situation involving these 
stimuli. Since older Ss, even without 
pretraining, are more likely than 
younger Ss to invent names or other 
descriptive phrases for the stimuli, 
the prior learning of names should be 
relatively more effective for younger 
Ss than for older Ss. Accordingly, 
each experimental group consisted of 
a younger and an older subgroup in 
order to study the interaction between 
age and type of pretraining. 


is 


is due 


Metuop 


Apparatus.-The primary apparatus con- 
sisted of a round wooden disc, 18 in. in diameter, 
mounted on the shaft of an electric motor. 
Fastened to the top surface of the disc, one 
opposite the other, were two identical wooden 
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boxes, 4 in. on each side, with hinged covers. 
The hinges were placed on the outsides of the 
boxes on the sides toward the center of the disc. 
The tops of the boxes were covered with a 
transparent acetate, under which the stimuli 
could be placed, and a wad of white cotton was 
put in the bottom of each box to prevent the 
rattling of the rewards placed in the boxes. A 
normally open push-button microswitch was 
wired into the circuit of the motor so that the 
motor could be started or stopped by pressing 
or releasing the switch. A section of rubber 
garden hose was mounted on the outer rim of 
the dise in order to protect S from the rotating 
disc and to provide £ with a smooth area to 
grasp in stopping the disc. ‘The entire apparatus 
was covered with a heavy coat of flat black paint. 

"There were two stimuli, each consisting of a 
set of black-lined concentric circles placed on 
white backgrounds 4 in. square. One of the 
stimuli (S-7) had 7 concentric circles, the other 
(S-5) had only 5. The inner circle in each 
stimulus was 4 in. in diameter; the outer circle, 
3in. The remaining circles were spaced equi- 
distant between the and inner circles. 
Thus, the two stimuli differed in the number and 
density of concentric circles. 


outer 


Two additional 

pairs of the stimuli, identical to those described 

above, were used in the pretraining session. 
Experimental design 


The experiment was 
conducted in four distinct phases, all given to 


each S on the same day. In the first phase, the 
pretraining stage, the stimuli were presented 
two at a time and Ss in one group (Group N) 
were required to call S-5 by the name “five” 
and S-7 by the name “seven.” ‘The second 
group (Group D) was required to discriminate 
the stimuli by saying “same” when both stimuli 
were 8-5 or S-7, and “different” when one 
stimulus was S-5 and the other S-7, In the 
second phase, / familiarized Ss with the delayed 
reaction experiment by putting an orange 
stimulus on one box and a blue stimulus on the 
other, baiting one of the boxes with a toy coin, 
spinning the disc briefly, and then asking S to 
find the coin. In the third phase, the test 
problem, the concentric circles were used as 
stimuli on the boxes, one box was baited each 
time, the dise was rotated rapidly for 25 sec., 
and S was then asked to find the coin. Finally, 
in a check phase, concentric circles were removed 
from the boxes, one of the boxes was baited, the 
dise was briefly rotated, and S was asked to find 
the coin. This last step was taken in order to 
control for the use of extraneous cues by S. 
Subjects.—The Ss were 54 children from the 
Iowa Child Welfare Research Station preschool 
laboratories. They were divided into two age 
groups; the younger group of 26 Ss ranging from 
3 yr.,9 mo, to 4 yr., 9 mo.; the older group of 28, 
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from 4 yr., 10 mo. to 5 yr.,6 mo. One-half of 
each age group was randomly assigned to ( 7rOUp 
N, the other half to Group D. Seven additional 
Ss were eliminated from the experiment following 
failure to learn the pretraining task in the 72 
trials allowed. Of these, 4 were in Group N and 
3 were in Group D. 

Most of the Ss had previous experience in 
name-learning experiments and about 12 had 
previous experience in a delayed reaction experi 
ment. In neither were the 
stimuli and the names of the present experiment 
used, 


case, however, 


Procedure.—The S was brought 
experimental room by a tamiliar £, 


into the 
The £ lay 
out two pairs of the concéntric circle stimuli on a 
table before S. Group N Ss were told that the 
name for S-5 was “five” 
S-7 was “seven.” 


and that the name for 
The stimuli were then pre- 
sented two at a time and S was informed that 
he could tell the difference between 
“five” 


“seven” and 
The Ss were 
not urged to count, although a few Ss in both 
groups did count. Four different stimulus 
settings were used: in one, both stimuli were S-5; 
in a second, both were S-7; in a third, S-5 was on 
the left, S-7 was on the right; and in the fourth, 
S-7 was on the left, S-5 on the right. The 
presented in a prearranged order 
such that in each successive block of four trials, 


by the number of circles 


stimuli were 


each of the four settings appeared once. A 
trial was counted correct only if S named both 
stimuli correctly. Group D Ss were treated 
like those of Group N except that they were not 
told the names of the stimuli and their task was 
structured for them as saying “same” when both 
stimuli were S-5 or S-7, and “different” when one 
stimulus was S-5 and the other S-7. The 
stimuli were presented in the same order for 
Group D as for Group N. The Ss in both 
groups were continued in the first phase until 
they made correct responses on 12 consecutive 
trials, or until 72 trials had been given. In the 
latter case they from the 
experiment. 


were eliminated 

In the familiarization phase, S was taken to 
the apparatus described above. A 4-in. square 
piece of orange construction paper was placed 
on top of one of the two boxes, and a similar 
piece of blue paper was placed on the other. 
While S watched, £ dropped a toy coin onto the 
wad of cotton in the bottom of one of the boxes 
and closed the lid. The £ then asked S which 
box contained the coin. If S responded cor- 
rectly, E immediately spun the dise rapidly and 
stopped it. ‘The time required to spin and stop 
the disc was ordinarily about | sec. When the 
disc was stopped, S was asked to find the coin. 
This procedure was repeated with the orange 
and blue stimuli, in an irregular alternation, 
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until S had made 


responses. 


three consecutive correct 


Before beginning the test problem, S was told 
that he would now have an opportunity to win 
more toy coins, and that when he had won 
enough he would be able to buy a toy with them. 
The orange and blue stimuli were replaced with 
the concentric circles. Six trials were given to 
each S with the coin being placed in one of the 
two boxes in the order: S-5, S-7, S-7, S-5, S-7, 
and S-5. The £ would bait the appropriate box 
for that trial and would then ask S to point to 
the baited box in order to assure that he had 
observed the baiting. Group N Ss were also 
asked to name the baited box. When S had 
immediately spun the 
dise and kept it spinning for 


responded correctly, E 
25 sec. at a speed 
that made it impossible for S to follow the baited 
box visually. At the end of the 25-sec. period, 
E immediately stopped the disc and asked S to 
find the The between-trial 
averaged approximately 1 min. 


com interval 

The check problem was given in order to 
determine whether or not S’s delayed reaction 
was based on extraneous cues. ‘The discriminal 
stimuli were removed from the boxes and S was 
given 6 trials with a delay of about 1 sec. as in 
the second phase. The Ss’ choices were re 
throughout the 
The 


were 


corded on prepared sheets 
delayed reaction phases of the experiment. 
Ss’ verbalizations regarding the stimuli 


recorded throughout the experiment 


RESULTS 


The number of trials 
criterion of 12 


Pretraining. 
to the 
correct responses was determined for 
each S. The means and SDs of these 
are Table 1. It is 
apparent that both the younger and 
older Ss of Group D reached the 
criterion in fewer trials than did those 
of Group N. Despité sizable differ- 
ences in the means, a treatment by 
levels analysis of variance failed to 
show significant differences in the 
main effects for age or pretraining or 
in their interaction for either the raw 
data or a square-root transformation 
of the raw (P > 10). With 
some caution, it may be assumed that 


consecutive 


presented in 


data 


the two pretraining groups had ap- 


proximately with 


the 


equal experience 


stimuli during the pretraining 
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task. In event, all Ss had 
demonstrated discrimination between 


any 


the stimuli at the same high response 
criterion.” 

Only nine incorrect responses were 
made during the familiarization phase, 
and no S made more than 
correct choices prior to reaching the 


two in- 


criterion of three consecutive correct 
responses. 

The mean 
number of correct choices in the six- 
trial check for cues is 
presented in Table | for each group. 
A mean of 3.0 is expected on the basis 
of chance alone. It is apparent that 
none of the four groups differs 
markedly from the expected mean and 
this observation is borne out 


Test for extraneous cues. 


extrancous 


stalis- 
tically in a treatment-by-levels analy- 
sis of variance. None of the main or 
simple effects is significant at less than 
the 10% level. ‘Thus, it seems safe to 
assume that Ss were unable to respond 
correctly, even with short delay, 
without the use of the discriminal cues 
on which they had received pretraining. 

Test phase. Table 1 the 
means and SDs of the number of 
correct responses for the four groups 
during the critical test phase of the 
experiment. From this table, it is 
apparent that Group N, on _ the 
average, performs better than Group 
D; that the older Ss perform better 
than the younger Ss; and that the 
younger Ss in 


gives 


Group D_ perform 
considerably worse than Ss in any 
other subgroup. A_ treatment-by- 
levels analysis of these data is sum- 


marized in Table 2. The significant 


? Correlations obtained between the number 
of pretraining trials to criterion and the number 
of correct responses on the test suggest that even 
if the difference were significant, it could be 

The 
variwu 
groups were: D—O, — 05;N-—O, — 05; D-—Y, 
— .27; and N-—Y, .03 Ihe correlation for a 


Ss cx mbined was ~ 17 


safely disregarded for present purposes 
Pearson correlation coefficients for the 
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TABLE 1 


Means ann SDs iw THe Paerrarnine, Tue Test PHase, ANd THE 
Cuecx ror Extrangous Cues 


Pretraining Phase: 
Trials to Criterion 


Group 


Young 
Old 
Young 
Old 


interaction of age and pretraining 
suggests the need for an examination 
of the simple effects. The difference 
between means for Group N and D 
was evaluated by a ¢ test separately 
for the older and younger groups. 
For the older Ss,t = .20 and P > .20. 
For the younger Ss, t was 3.96, 
P < OO1. Thus, it appears that the 
effectiveness of the name-learning was 
restricted primarily to the younger Ss. 

Additional evidence for the rele- 
vance of names to the performance in 
the delayed reaction situation is pro- 
vided by an examination of the records 
of Ss in Group D. The E recorded 
verbalizations of 8 Ss in this group 
who used discriminal names to refer 
to the concentric circles (e.g., “bigger 
one,” “shiny “darker one,” 
“wider one,” etc.). The mean number 
of correct responses on the test for 
these Ss was 5.38 and the mean for 


9 


one, 


TABLE 2 


ANALYsis or VARIANCE oF THE NUMBER OF 
Correct Cuorces in Six-TRriat 
Test Session 


Source 


Pretraining (P) 
Age (A) 
PxXA 

Within cells 
Total 


Extraneous 
Cue Check 
Correct Choice 


Test Phase 
Correct Choices 


Mean 
2.92 
5.00 
4.77 
5.14 


Mean 


the remaining 19 was 3.42. The value 
of t obtained was 3.15 with P < .005. 


Discusston 


The results indicate clearly that the 
pretraining requiring Ss to learn names 
for the discriminal stimuli resulted in a 
performance in the delayed reaction test 
that was superior to the performance of 
Ss who had pretraining that involved 
only discrimination of the stimuli. The 
superiority of Group N appears due 
primarily to the younger Ss, since the 
older Ss in the two pretraining groups 
did not differ markedly. This inter- 
action of the pretraining with age levels 
may be attributed to the fact that older 
Ss of Group D were more inclined to 
invent names for the stimuli than were 
the younger Ss. Six of the 8 Ss in 
Group D who spontaneously verbalized 
names for the stimuli were in the older 
age level. On the other hand, the 
experimentally introduced names pro- 
vided the younger Ss of Group N with 
symbolic responses to the stimuli that 
they might not otherwise have attained. 

There are two alternative, although 
not necessarily incompatible, explana- 
tions for these results. The first makes 
use of the hypothesis of acquired 
distinctiveness of cues (3, 8,9, 13) which 
assumes that the learning of distinctive 
(verbal) responses to similar stimuli 
decreases the interstimulus § generali- 
zation in subsequent learning of other 
discriminal responses. In the present 
experiment, the concentric circles were 
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made functionally more dissimilar for 
Group N by virtue of the distinctive 
names that had been learned for them. 
According to this explanation, Skalet’s 
(14) animal forms may have been more 
distinctive than the geometric figures 
because the children had names available. 

Another explanation of the findings is 
offered by Skalet’s suggestion that the 
symbolic responses of articulate organ- 
isms serve as substitutes for the absent 
crucial stimuli (e.g., 
baited’’). During the delay period, S§ 
reproduces the name for the baited 
stimulus, and following the delay, the 
This 
hypothesis is rather well supported by 
the verbalizations of Ss in Group N 
duri Nine of the 27 
Ss verbalized the name 
of the baited box during delay periods, 
sometimes repeating the name several 
times during a single delay period. This 
verbalization was not observed for Ss in 
Group D. 


“box ‘seven’ 1s 


name serves to direct his choice. 


the test phase. 
spontaneously 


This hypothesis can also be 
used to account for the results of several 
experiments (3, 4, 5, 7, 8,9, 13) in which 
was found that discrimination learning 
problems are learned more rapidly by Ss 
who have had verbal pretraining with the 
relevant stimuli. In this case, the delay 
period that is presumed to be bridged by 
of the names is the intertrial 
interval; that is to the names 
facilitate S’s recall of correct and in- 
correct stimulus response sequences on 
previous trials. 


means 


say, 


SUMMARY 


Fifty-four preschool Ss were divided into two 


groups. One group was given training on a pair 
of stimuli that involved 


for the stimuli. 


learning distinctive 


names The other group was 


given discrimination training with the stimuli 


without being required to learn their names 


Subsequently, Ss were given a delayed reaction 


test involy The Ss 
correct 
choices of the baited stimuli following delay than 
lid Ss who had not learned names, although this 
lifference was due primarily to the younger Ss. 
The results were 

hypothesis that tl 
for the stimuli 


ng the pretraining stimuli. 


who had learned names made more 


nterpreted in terms of a 
¢ possession of verbal names 


permits S to produce a repre 


AND DELAYED REACTION 


sentation of the absent stimuli during the delay 


period, 
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INCIDENTAL LEARNING OF STIMULUS FREQUENCIES 
IN THE ESTABLISHMENT OF JUDGMENT SCALES ! 


ALLEN PARDUCCI? 


University of Oregon 


The judgment theories proposed by 
Helson (2), Johnson (3), and Volk- 
mann (6) have been based in large 
part upon studies employing the 
method of single stimuli. This 
method requires S to judge each 
member in the presented stimulus 
series. While it imposes a procedural 
control necessary for the development 
of simple judgment principles, these 
principles should themselves be tested 
under more representative experi- 
mental conditions. The so-called 
“absolute” judgments of everyday 
life are not typically made by the 
method of single stimuli. A judg- 
ment of the aesthetic value of a 
painting, for example, may be made 
after exposure to many other paint- 
ings, none of which evoked an overt 
judgment. And yet we would assume 
that when such a judgment is made, 
it is influenced by preceding experi- 
ence in ways similar to those in which 
the laboratory judgment is influenced 
by the preceding stimuli in the series 
being judged. 

Experiments have shown that ex- 
posure to isolated, unjudged stimuli, 
interpolated between the judged mem- 
bers of a stimulus series, can have 
systematic effects upon the judgment 
scale (e.g., 1, 2). In the present 
study, Ss are exposed to an extended 
series of unjudged stimuli under in- 
structions to respond to some irrele- 


' This experiment was facilitated by a research 
grant from the University of Oregon. Acknowl 
edgment is due Mr. Terry Kohlhagen for aid in 
the tabulation and analysis of the data and to 
Dr. Leo Postman for numerous helpful sug 
gestions. 

? Now at Swarthmore College. 
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vant characteristic of the stimuli. A 
comparison made between the 
effects of this incidental exposure and 


is 


those of the more traditional exposure 
conditions. These effects are studied 
in terms of Ss’ subsequent judgments 
of a restricted range of stimuli and 
also in terms of other 
learning. 

The establishment or modification 
of a judgment scale involves learning 
in the sense that new judgments are 
given to old stimuli. What 
learned? The fact that judgment 
scales seem to depend upon the range 
(6) or distribution (2, 3, 5) of the 
stimulus series suggests that the 
learning of the stimulus distribution 
may be a crucial part of the judgment 
process. This hypothesis 
ported by evidence that different 
sources of information about the 
stimulus distribution, verbal descrip- 
tion (4) or special kinds of stimulus 
shift (5), cause Ss to judge the stimuli 
as if they had been drawn from the 
distribution suggested by the special 
information. The experi- 
ment compares what Ss learn about 


measures of 


else is 


is sup- 


present 


stimulus frequencies under the usual 
judgment instructions with what they 
learn when exposed the same 
stimuli but under instructions to 
respond to an irrelevant characteristic 
of the stimuli. 


to 


Metuop 


Subjects.—Introductory psychology students, 
300 in number, were assigned to 17 groups of 
from 8 to 29 Ss each. Assignments were made 
according to the times Ss were free to serve 
Materials.—The stimuli 


cards, varying both in size and in the number « 


white 


f 


were square 
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dots which were printed in the center of each 
card. The widths of the cards were as follows: 
2, 24, 3, 33, 44, 54, 64, 74, and 9 in. The dots 
each had a diameter of 4 in. The patterns in 
which they appeared formed a right triangle, 
12 in. on both the vertical and horizontal sides, 
the right angle being closest to the lower-right 
corner of the card. The number of dots in the 
respective patterns was as follows: 3, 4, 5, 6, 7, 
9, 12, 15, and 19. Each pattern appeared 
equally often on each of the different-sized cards 
so that there was no correlation between the two 
stimulus dimensions. For purposes of identi 
fication, the respective stimuli on both dimen- 
sions will be referred to by the numerals | 
through 9 (in order of increasing size or density) 

Procedure.-Each group (the respective 
groups were run at different hours) was given 
response booklets in which to record the judg- 
ments and then read a set of judgment instruc- 
tions, like those in (5S), appropriate to the 
stimulus dimension on which they were to make 
their preshift judgments. The  size-judging 
groups, I, III, VI, VIII, [X, and XII in Table 1, 
made their judgments in terms of a five-category 
size scale. The remaining groups were in- 
structed to judge the numerousness of the dots, 
again in terms of a five-category scale. The 


stimuli were displayed manually to Ss, being 
held one at a time, at 4-sec. intervals, before a 


black, 28 & 44-in. backdrop. 

There were 88 preshift presentations for each 
of the 12 experimental groups. For three of the 
size-judging groups, VI, VIII, and XII in Table 
1, each of the nine different sizes appeared with 
equal frequency (or approximately equal since 
two of the sizes appeared only nine times while 
the other seven were each shown 10 times) 
The other three size-judging groups, I, III, and 
IX, were given 16 presentations of each of the 
five largest cards and only two presentations of 
each of the four smallest cards. The dot 
judging groups were similarly divided. In all 
cases, the different sizes and dot patterns were 
presented in quasi-random order and there was 
no correlation between the size of the card and 
the number of dots appearing upon it. 

After Judgment 85, the groups to be tested 
for their recall of the stimulus frequencies 
(Groups I through VIII) were instructed to 
close their response booklets. Each S in these 
groups was then handed a page containing 12 
frequency-estimation scales and told to record 
an estimate of the relative frequency with which 
each of the respective sizes (or patterns of dots) 
had been presented. Each group had to make 
these estimates in regard to the stimulus di- 
mension on which they would make their post- 
shift judgments ; this meant that half the groups, 
the incidentals, had to recall the stimulus fre- 
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TABLE 1 


Summary or Exrerimentat Conprrions*® 


Post 
shift 
Distri 
butiont 


Preshift 
Distribution 


Frequency 


( 
sTOUp Estimates | 


s1ze—stimuli 
present 


skewed 
rect. 


SIZE 
dots 


SIZE 


DOTS ~——Tect., 
size—skewed 


size—stimuli 
present 
skewed | size—stimuli 
rect. absent 


SIZE 
dots 


poTs —rect. 
size—skewed 


sizE—stimulli 


absent 
pots —skewed 
size-—Tect, 


pots—stimuli 


present 


DOTS 


SIZE 
dots 


rect. 
skewed 


poTs—stimuli 


present 


bpoTSs 


pots —skewed stimuli 


size-—rect, 


DOTS 
absent 


DOTS 


SIZE 
dots 


rect. 
skewed 


poTs —stimuli 


absent 


poTs 


skewed 
rect. 


SIZE 
dots 


boTs—rect. 
size—skewed 


pots——skewed 
size —rect. 


XII | size 


dots 
! 


boTS 


rect. 
skewed 


* Judged dimension in upper case 

* Both size and dot distributions rectangular 
of judged dimension: No. 1 to No. 5; 
mension: No. 1 to No, 9 


range 
unjudged di 


quencies in regard to the irrelevant dimension 
(the one which they had not been judging), 
For half of these groups, each frequency estimate 
was made in the presence of the stimulus whose 
past frequency was to be recalled; for the other 
half, no stimuli were shown during the frequency 
test, Ss making their estimates in the order of 
recalled size (or numerousness of dots). In all 
cases, these estimates were checked on con 
tinuous scales running from “never” (O mm.) 
through “less often than average” (374 mm.), 
“average” (75 mm.), “more often than average” 
(1124 mm.) to “always” (150mm.). Judgments 
of Ss’ confidence in these estimates were also 


checked on the same scales (see results for the 
measure used), 
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After the frequency and confidence estimates 
had been made and the scales collected, all Ss 
were instructed to re-open their judgment 
booklets and to record their judgments of the 
remaining stimuli either in terms of the original 
size or dot categories (intentional groups) or in 
terms of the alternative set of categories (inci- 
dental groups). ‘The distribution of the stimuli 
on the dimension being judged was shifted so 
that it now included only stimuli No. 1 through 
No. 5, the five smallest cards (or least-dense 
patterns). ‘There were 110 of these postshift 
presentations, and each stimulus (in terms of 
the relevant dimension) appeared 22 times. The 
relevant stimulus values in both pre- and post- 
shift series had the same distributions as those 
for the NR groups in (5) where the preshift 
series had provided an effective anchor for the 
postshift series. All nine of the irrelevant 
stimulus values (sizes or dot patterns) appeared 
with approximately equal frequency. Again, 
there was no correlation between the two 
stimulus dimensions. 

The four remaining experimental groups, LX 
through XII, shifted directly to the postshift 
series without making the interpolated frequency 
estimates. These groups were also divided 
between intentional and incidental and between 
card-judging and dot-judging conditions. The 
experiment thus followed a 3 X 2 X 2 factorial 
design. There were three conditions for the 
frequency and confidence estimates: (a) test 
with stimuli present, (b) test with stimuli 
absent, and (c) notest atall. A second variable, 
the judgment instructions, had two conditions: 
(a) incidental—frequency estimates and post- 
shift judgments made in terms of the dimension 
which had been irrelevant during the preshift 
series, and (b) intentional—all judgments made 
in regard to the same stimulus dimension. The 
third variable, the nature of the stimuli, also 
had two conditions: (a) frequency estimates and 
postshift judgments made in regard to size, and 
(b) frequency estimates and postshift judgments 
made in regard to dots. All of the experimental 
conditions are summarized in Table 1. 

Five additional groups served as controls. 
These first judged either the size or dot postshift 
series. They were then instructed to make 
frequency estimates in regard to either the 
judged or unjudged dimensions. Groups IX 
and X also made frequency estimates following 
the presentation of their postshift series. The 
specific control conditions will be described in 
greater detail when the comparisons between the 
experimental and control responses are made in 
the next section. 


ResuLts 


Incidental learning 
scales.—The postshift judgments were 


of judgment 
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converted to numerical form: very 
large (very many) = 5; large (many) 
= 4; medium (average) = 3; small 
(few) = 2; very small (very few) = 1. 
To simplify tabulation, only the 
judgments of the middle stimulus, 
No. 5, were used. Judgments of this 
stimulus showed the greatest change 
in the course of the postshift series. 
Two measures of each S’s postshift 
judgments were computed: (a) his 
initial postshift level, the mean of his 
first three postshift judgments of No. 
5, and (b) his total postshift change, 
3/10 of the sum of his last 10 post- 
shift judgments of No. 5 minus the 
sum of his first three judgments of 
No. 5. 

What effect did incidental vs. in- 
tentional exposure to the preshift 
series have upon the postshift judg- 
ments? As sh-~1 in Table 2, the 
incidental grou x. each case judged 
stimulus No. 5 .o be larger (or as 
having greater numerousness of dots) 
than it was judged by the corre- 
sponding intentional groups. They 
also showed, in five of the six com- 
parisons, greater total change, i.e., 
greater adjustment to the postshift 
position of No. 5. This stimulus had 
been at the middle of the preshift 
range, but it was the largest (or the 
one with the most dots) in the post- 
shift series. Thus, the lower judg- 
ments by the intentional groups 
reflected a greater anchoring effect 
by the intentionally learned preshift 
series. The statistical significance of 
this effect was indicated by analysis 
of variance (following the procedure 
described for unequal N’s in 7), the 
effects of incidental vs. intentional 
preshift exposure being significant 
(P < .01) in both the initial-level 
and total postshift change analyses 
(total-change scores logarithmically 
transformed to reduce heterogeneity 
of variance to P > .10). 

Additional information concerning 
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TABLE 2 


Mean Postsnirt JupGments 


Intentional 


Stimulus Initial Level 


oes —_— 
Group N Mean 


Total Change* 


SD Group 


Incidental 


Initial Level Total Change* 


Mean 


Frequency Test with Stimuli Present 


13 | 2.61 | 6 


768 | 152} | 14 /3a7| 61 


.770 


29 | 2.82 | 4 | -731 065 | vi | 27 | 3.21 | .52 | .833 
j | | 


Frequency Test with Stimuli Absent 


15 | 2.95 | .44 7 | 106 | Iv | 15 | 3.91 S4 


m% | 289 | 51 | 801 | 166 


vit | 27 | 3.73 | 68 


No Frequency Test 


18 | 2.33 | Al | .733 


Size 
: ub | 2.53 i 742 


Dots 


* Each score = log (total postshift change + 5.0). 


the effect of the preshift experience 
upon the postshift judgment scales 
was obtained through comparison 
with control groups whose first judg- 
ments were either of the postshift size 
or postshift dot series. In both cases, 
these control means were significantly 
higher (P < .001) than the means of 
the corresponding intentional groups 
not receiving an interpolated fre- 
quency test (Groups IX and XI). 
The same comparison for the corre- 
sponding incidental groups (X and 
XII) yielded a statistically significant 
t for the dots (P < .01) but not for 
the sizes (P > .40); though here too 
the difference was in the correct 
direction. There is thus evidence for 
the incidental learning of judgment 
scales, but such learning appears to 
be less effective than intentional 
learning. 

Incidental learning of stimulus fre- 
quencies.—Since the larger stimuli (or 


B49 


| 906 


the ones with more dots, depending 
upon which dimension would later be 
tested) were presented more fre- 
quently during the preshift series, 
learning of the stimulus frequencies 
would be indicated by greater fre- 
quency estimates for these stimuli. 
For each S, a frequency-estimate 
score was obtained by subtracting the 
sum of his estimates (distance in 
millimeters between his frequency 
check and the “never” mark on each 
scale) for stimuli No. 1 through No. 4 
from the sum of his estimates of 
stimuli No. 6 through No. 9 (com- 
parable measures for Ss tested in the 
absence of the stimuli being based on 
the differences between the sums of 
the estimates for the first and last 
half of the scales, adjusted in each 
case for the total number of scales 
used). 

What were the differences in fre- 
quency learning as a result of inci- 
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dental vs. intentional exposure to the 
stimulus series? For each of the four 
appropriate comparisons, the inten- 
tional group had a higher frequency- 
estimate score than the corresponding 
incidental group (see Table 3). This 
greater accuracy in discrimination of 
the skewness of the preshift series was 
statistically significant (P < .01 in 
an analysis of variance performed 
upon these scores). 

Was there any incidental learning 
of stimulus frequencies? Comparison 
with control groups tested following 
incidental exposure to the rectangu- 
larly distributed series (sizes or dots) 
indicated that when the frequency 
test was made in the presence of the 
respective stimuli, even the incidental 
groups showed significant discrimi- 
nation of skewness of the series 
(P < 01). However, the evidence 
for such discrimination fell short of 
customary statistical standards when 
the frequency estimates were made in 
the absence of the stimuli (P > .05). 
This dependence of the results upon 
the method of test was reflected in 
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the analysis of variance of the experi- 
mental groups’ frequency-estimate 
scores by significant mean squares 
associated both with the method of 
test and with the interaction between 
the test method and the nature of 
preshift exposure (P < .05 and .OI, 


respectively). 

Incidental learning of stimulus 
range.—Since stimulus range may be 
a major determinant of judgment (7), 
stimuli one step below and one step 
above the regular series were also 
displayed during the frequency-esti- 
mate test for Groups I, II, V, and VI. 
These stimuli had not previously been 
shown, and therefore smaller fre- 
quency estimates for them would 
indicate more accurate discrimination 
of the stimulus range. An analysis 
of variance performed upon these 
estimates indicated that the inten- 
tional learning was again superior to 
incidental learning (P < .01); but 
even for the incidental groups, the 
mean estimate of these stimuli was 
significantly better (P < .OO1) than 
chance (which would have been a 


TABLE 3 


Mean Frequency Estimates ann Conripence Ratincs 


Intentional 


Frequency 


F Z 
Stimulus Ketimates 
| 
| 
| 


/ | 
N | Mean | 


- | N [Mean 


Confidence 
Ratings 


Incidental 


Frequency 
Estimates 


Confidence 
Ratings 
Group! 


SD | 


| 
N Mean SD 


Frequency Test with Stimuli Present 


| y| | 25 [280.1 71.3 2 | 
| 


| 
| 13 | 168.5] s00 | 12 | 34.27 | 14.3 | 
| ae Racy 


81 | 


Wh 16 “7ra) 81.5 
| 24 | 


93) V 134.3) 82.5, 


ene Test with Stimuli Absent 


“Dots 


vil) i) 0 for 82.5 | 21 ‘a 87) Vutt| 7 


12 | 49.9 14.6 
28 54.8 | 


90.4 | 88.6 


13.0 
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check at “with average frequency,” 
the mid-point of the frequency-esti- 
mate scale). 

Confidence estimates.—For each fre- 
quency estimate, S was asked to 
indicate with two vertical lines the 
range of relative frequencies he would 
accept as possible for that stimulus. 
Thus, the shorter the distance between 
lines, the greater S’s confidence in 
his frequency estimates. Mean con- 
fidence ratings were computed for 
each S. As shown in Table 3, each 
intentional group was more confident 
than the corresponding incidental 
group, the difference exceeding the 
OL level in an over-ail analysis of 
variance. 


Discussion 


The evidence that a judgment scale 
may be established through incidental 
exposure to a series of stimuli, while 
perhaps not surprising, does suggest a 


certain generality for principles of judg- 
ment which have been based upon studies 
using the traditional method of single 


stimuli. However, incidental exposure 
has less of an effect upon the judgment 
scale than exposure under instructions 
to judge. This finding extends the 
results obtained by Helson (2) and 
Brown (1) using interpolated, unjudged 
stimuli without distracting cover tasks. 
It thus appears that stimulus 
exposure, without any set or instructions 
in regard to the stimulus dimension that 
will subsequently be relevant, has de 
monstrable effects upon judgment. 

The results also suggest that specu- 
lation about judgmental processes can 
be bolstered with independent measures 
obtained in the judgment situation. 
They show that Ss acquire knowledge 
of both the range and the relative 
frequencies of stimuli to which they are 
exposed. Such knowledge is more ac- 
curate when Ss have been instructed to 
make absolute judgments of the stimuli 
than when their attention has been 
directed to an irrelevant stimulus di- 


mere 


117 


mension, and they know that it is. 
This greater confidence in what they 
have learned is paralleled by greater 
resistance to the demands of a changed 
stimulus series. 

This last finding, while predictable on 
a common-sense basis, would seem to 
offer some difficulty for a general 
quantitative theory of judgment. 
Simple differential weighting (2, p. 5; or 
3, p. 349) could handle the greater 
initial anchoring effect of the judged 
preshift series, but a much more complex 
formulation would seem necessary to 
describe the greater postshift change by 
the incidentally exposed groups. Nor 
is this finding predictable from the 
general hypothesis developed in (5) 
that postshift change will be greater 
when the postshift series is a less likely 
sample from the preshift distribution; 
for here the respective stimulus distri- 
butions are the same for all groups. 

One might speculate that the inci- 
dentally exposed Ss, having less con- 
fidence in what they had learned about 
the preshift stimuli, quickly forget 
during the postshift that the 
missing stimuli have ever been shown. 
Frequency estimates obtained after the 
postshift series would provide a test for 
such However, the fact 
that there was little evidence for fre- 
quency learning when the test was made 
in the absence of the stimuli suggests 
limitations for both £’s speculations and 
S’s retrospections. 


serics 


speculation. 


Hypotheses, strate 
gies, or expectancies appear much more 
elusive than common sense would have 


them—existing, perhaps, only as ex 


tremely limited response potentialities. 


SUMMARY 


This experiment studied the effects of inci 
dental exposure to an extended series of stimuli 
An attempt was made to assess what Ss learned 
about the stimuli both in terms of their estimates 
of the respective stimulus frequencies and also 
in terms of their absolute judgments of subse 
quently exposed stimuli. These indexes of 
learning were compared with those for Ss 
exposed to the same series, but under instructions 
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to judge each stimulus in terms of the subse- 
quently relevant stimulus dimension. 

The results indicated that judgment scales 
may be incidentally learned but that such 
learning is less effective and also more subject 
to interference. Instructions to judge the 
stimuli facilitated learning of both their range 
and relative frequencies, and also increased Ss’ 
confidence in their own frequency estimates. 
These results are consistent with the assumption 
that establishment of an absolute judgment scale 
involves the acquisition of information about 
the distribution of the stimuli presented for 
judgment. 
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THE INFLUENCE OF 


SIMILARITY, 


MEANINGFULNESS, 
AND SERIAL 


INTRALIST 
POSITION ON RETENTION 


BENTON J. UNDERWOOD AND JACK RICHARDSON 


Northwestern University * 


The first purpose of the present 
study is to determine the influence of 
meaningfulness on recall of rote- 
learned materials. While no ade- 
quate study has been published deal- 
ing with this variable, it is a common 
belief (e.g., 5, 6, 10, 16) that the more 
meaningful the material the better 
the retention. Only Osgood (8, p. 
562) has questioned the assumption. 
Recent findings on recall as a function 
of ability level (13) and intralist 
similarity (e.g., 11) suggest that 
variables which influence rate of 
learning may have little or no in- 
fluence on recall. Our first objective, 
then, is to test recall as a function 
of meaningfulness when response 
strength of items of different levels 
of meaningfulness have been equated 
at the end of learning. 

As indicated above, our previous 
studies have suggested that intralist 
similarity is a minor variable at 
recall. In general we have found no 
difference in recall as a function of 
intralist similarity although in two 
cases (12, 15) the recall was better 
for the high-similarity lists. But, as 
pointed out elsewhere (14), these 
determinations may have been con- 
founded by the reciprocal relation 
between intralist and interlist simi- 
larity when Ss serves in a series of 
conditions as they did in our experi- 
ments. Therefore, we felt it ad- 
visable to make at least one further 
test of the similarity variable using 


1 This work was done under Contract N7onr- 
45008, Project NR 154-057 between North- 
western University and the Office of Naval 
Research. 


Ss who had had no previous formal 
experience with the materials. This 
rules out interlist similarity as a 
source of confounding but does not, 
of course, eliminate extra-experi- 
mental learning as a source. In any 
event, we feel that the present test of 
the influence of intralist similarity on 
retention is as “clean’”’ as it is possible 
to make it using standard verbal 
materials. 

Finally, we wanted to determine 
the influence of serial position on 
recall. Evidence from Hovland (4) 
indicates that on recall 24 hr. fol- 
lowing learning, items at each serial 
position are recalled about equally 
well, but that on the immediately 
succeeding trial the classical bowed 
curve is quite evident. However, 
Hovland’s findings at recall may be 
heavily influenced by proactive inhi- 
bition from the many previous lists 
learned by his Ss. Furthermore, only 
if items at different serial positions are 
equated for response strength at the 
end of learning can the effect of serial 
position on recall be evaluated un- 
confounded by differences in response 
strength which probably are present 
for items at different serial positions 
at the end of learning. 


Metuop 


Materials.—Four serial lists of nonsense sy! 
lables of 10 items each were used. Two of the 
lists were made up of items of from 93.3% to 
100% association values and the other two from 
among items of from 0.0% to 20.0% association 
value (2). One of the two lists at each level of 
meaningfulness had high intralist similarity, one 
low intralist similarity. ‘The two lists with high 
intralist similarity had only four consonants, 
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TABLE 1 


Lists Usep 


High Meaningfulness Low Meaningfulness 


High 
Similarity 


(HM-HS) 
BES 


High 
Similarity 


(LM-HS) 


Low 
Similarity 


(HM-LS) 


Low 
Similarity 








(LM-LS) 
CEF 











each being used five times. In the lists with low 
intralist similarity each consonant was used only 
once. In each list each vowel was used twice. 
The four lists are given in Table 1. Letting H 
stand for High, L for Low, S for Intralist 
Similarity, and M for Meaningfulness, the four 
lists may be symbolized as follows: HM-LS, 
HM-HS, LM-LS, LM-HS. 

Ten orders of items for each list were prepared 
such that each syllable occurred equally often in 
each serial position. Each order was used an 
equal number of times so that bias due to a 
particular syllable being in a particular serial 
position should be minimal. The list was 
always preceded by three asterisks as the an- 
ticipatory cue for the first syllable. 

Subjects and procedure.—The four different 
lists represent the four conditions of the experi- 
ment. A total of 400 Ss served in the experi- 
ment, 100 for each list. The use of this rather 
large number of Ss is necessary for the type of 
analyses we wish to make and does not indicate 
a repudiation of small-sampling procedure. On 
E's schedule sheet the order of the four lists was 
random with the restriction that each occurred 
100 times. ‘The Ss were then assigned in order 
of appearance at the laboratory. Ten Ss were 
lost for failure to learn the most difficult list 
(LM-HS) and one for failure to learn the LM-LS 
list. None was lost for failure to learn the other 
two lists. Therefore, there is some selection of 
Ss in that the 100 Ss completing the LM-HS list 
probably were of higher average rote-learning 
ability than were those learning the other lists. 
Consequently, differences in rate of learning 
among lists are probably reduced when compari- 
sons are made between LM-HS and any of the 
other three lists. ‘This selection, however, has 
no effect on the retention data, as will subse- 
quently be shown. 


All Ss were naive to the formal rote-learning 
situation. Each S learned the list to which he 
was assigned to one perfect trial. Recall and 
relearning to one perfect trial took place 24 hr. 
after learning. Lists were presented at a 2-sec. 
rate on a Hull-type drum with 8 sec. between 
trials. All syllables were spelled. 


RESULTS 


Learning.—The essential learning 
data are shown in Table 2. In terms 
of mean or median number of trials to 
learn, the effects of the two variables 
are evident and confirm a number of 
previous studies; the higher the 
meaningfulness the faster the learning, 
and the lower the intralist similarity 
the faster the learning. Because of 
heterogeneity of variance which was 
not corrected by transformations, we 
have used the median test (7, p. 402). 
Both intralist similarity and meaning- 
fulness are significant beyond the 1% 
significance level and the Similarity 
X Meaningfulness interaction is sig- 
nificant between the 5% and 1% 
levels. The interaction indicates that 
the lower the meaningfulness the more 
effective is intralist similarity in re- 
tarding learning. 

The analysis of variance on mean 
errors per trial shows that meaning- 
fulness does not influence the number 
of errors made. Similarity, on the 
other hand, gives an F of 5.03 (.05 
> P> Ol). However, as can be 
seen from Table 2, this difference is 
completely produced by the lists of 


TABLE 2 
Triacs To LEARN AND Errors per Trial 
Duainc LEARNING 
‘ : ade | 
Trials | 
Mean | om | Median | Mean | 


i—- 


Errors 


HM-HS | 25.86/| 1.18 | 2 
HM-LS | 19.27| .92 | 1 
LM-HS | 42.57 
LM-LS | 29.18 


5 1.47 | 
3 | 147 
8 | 1.61 
5 | 1.32 


1.72 


4. 
7. 
1.54 | 27. 
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TABLE 3 
Numper Correct at Recatt anp NuMBER 
or Trias TO RELEARN 


Relearning 


4.89 
3.71 
7.59 
5.23 


low meaningfulness, where the 
greatest number of errors occur with 
high intralist similarity. This is sup- 
ported by an F of 5.00 for the Mean- 
ingfulness X Similarity interaction. 
In a previous study (11) using syl- 
lables in the middle range of associa- 
tion value, no difference in error 
frequency as a function of intralist 
similarity was found in spite of large 
differences in rate of learning. It 
appears, therefore, that unless the 
material has low meaningfulness, no 
difference in error frequency as a 
function of intralist similarity may be 
expected in serial lists. Also, it may 
be worth noting that the relative 
error frequency among conditions 
during learning and the relative recall 
among conditions (Table 3) corre- 
spond closely. 

Although meaningfulness does not 
influence error frequency, it is possible 
that the distribution of the errors in 
the course of learning could be dif- 
ferent. However, when Vincent-type 
curves were plotted for each list no 
appreciable differences among the 
curves were apparent. In all four 
cases the error frequency increased 
somewhat during the first third of 
learning, remained about constant for 
the second third, and decreased for 
the last third. 

Recall.—The recall and re- 
learning scores are shown in Table 3. 
Neither meaningfulness nor intralist 
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similarity significantly influences re- 
call. Analysis of variance shows that 
the Meaningfulness X Similarity in- 
teraction just reaches the 5% level. 
This indicates that with low meaning- 
fulness intralist similarity influences 
recall whereas with high meaningful- 
ness it does not. The recall is less 
for high intralist similarity than for 
low. 


Since wide differences in rate of learning of 
the different lists were evident, analysis of the 
raw recall scores is inappropriate as a means of 
getting the best possible estimate of the influence 
of the variables (13). Our major analyses, 
therefore, are based on probability analyses of 
acquisition curves as a means of adjusting for 
different response strengths at the end of learning 
so that recall can be evaluated without bias due 
to different initial strengths for the items in the 
different lists. Before turning to these analyses, 
however, we must dispose of the problem of 
different ability level of the Ss learning the lists. 
As indicated previously, Ss learning the LM-HS 
List probably had higher ability level than those 
learning the other three lists. While we have 
shown elsewhere that ability level is not related 
to retention (13), we have made corresponding 
analyses for the 100 Ss who learned each list 
in the present study. For each group of 100 Ss 
the mecian of the learning scores was found and 
then the 50 rapid-learning Ss were compared on 
recall with the 50 slow-learning Ss. In no case 
was there a difference in recall when response 
strengths at end of learning were equalized. 
Therefore, the influence of the major variables 
on recall can be determined without bias due to 
differences in ability level of the Ss in the four 
groups. 


In determining the amount lost 
over the 24-hr. retention interval, we 
first constructed successive proba- 
bility curves for the learning of each 
list. For each S for each list an 
estimate was made of the number of 
correct responses he would have 
gotten had there been another learn- 


ing trial. This estimate was based 


on the successive probability curve 
for all Ss in the group. 


Then for 
each S the actual number of items 
correct on recall was subtracted from 
the number expected to give a loss 
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TABLE 4 


Mean Loss Scores 





List 








HM-HS | HM-LS | LM-HS | LM-LS 
Mean 141 1.48 1.90 6 
om 20 22 20 19 














score. Details and rationale of this 
procedure have been presented else- 
where (9). Table 4 gives the mean 
loss score for each of the four 
conditions. 

We note first that, with high 
meaningfulness, intralist similarity 
has little influence on forgetting, but 
that with low meaningfulness the loss 
is greater for the list of high intralist 
similarity than it is for the list of low 
intralist similarity. This difference, 
however, is a function of two changes 
neither of which is statistically sig- 
nificant separately. First, with high 
similarity there is an increase in the 
loss from high to low meaningfulness. 
Second, with low similarity, there is a 
decrease in the loss from high to low 
meaningfulness. The smallest abso- 
lute loss occurs with the list having 
low meaningfulness and low intralist 
similarity. By analysis of variance 
the effects of Similarity and Similarity 
X Meaningfulness are both significant 
beyond the 5% level of confidence. 

As indicated above, there is a 
tendency for the list of low meaning- 
fulness to be better recalled than the 
list of high meaningfulness when 
intralist similarity is low. However, 
since this does not attain statistical 
significance, we must conclude that 
for the materials used here meaning- 
fulness does not influence retention. 
Since this conclusion is somewhat 
contrary to the usual assumption, we 
present in Fig. 1 the recall as a func- 
tion of strength of items for the 200 
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Fic. 1. Recall as a function of meaningful- 
ness when strength of associations have been 
equalized at end of learning. 


Ss learning the high-meaningful lists 
and the 200 Ss learning the low- 
meaningful lists. The percentage of 
items recalled is plotted against base 
lines which have been adjusted so that 
the strength of items at end of learning 
for the two levels of meaningfulness 
are equal at any given point along the 
base lines (13). If there is any trend 
discernible, it is that the list of low 
meaningfulness is better recalled than 
the list of high meaningfulness. 

We turn next to an analysis of 
recall as a function of serial position. 
In Fig. 2 the mean number of correct 
responses at each serial position has 
been plotted. Contrary to Hovland’s 
results (4), there is a bowed curve 
quite comparable to that obtaining 
during acquisition. More important 
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MEAN CORRECT RESPONSES AT RECALL 
é 


SERIAL POSITION 


Fic. 2. Mean number of items recalled at 
each serial position for all lists combined, 








for our purposes, however, is to 
determine whether the differences in 
recall as a function of serial position 
are largely a function of differences in 
strength of items at end of learning or 
whether something intrinsic to serial 
position accelerates forgetting. To 
do this we have again used the 
probability analysis of learning and 
then have adjusted base lines for 
recall so that expected probabilities 
of immediate recall are the same for all 
serial positions. To add some sta- 
bility to the data we have combined 
Serial Positions 1, 2, and 3; 5, 6, and 
7; and 8, 9, and 10, omitting Position 
4 entirely. We are thus dealing with 
three sections of the bowed curve. 
For each section independent proba- 
bility analyses of learning were made. 
From these, base lines were adjusted 
so that the strength of items at end 
of learning for each section was 
equivalent. The percentage correct 
on recall for each section was then 
plotted along the adjusted base lines 
as shown in Fig. 3. Although there is 
great irregularity, there is no evidence 
that the first and last sections of the 
list are any better recalled than the 
middle section. ‘There is a suggestion 
that the middle items (5, 6, and 7) 
are better recalled than the end items 
when such items are weak. However, 








ns 1 re 
a nah | 
a a’* . 
S NV TV YY 
id ah hoa” r\/ \{ 4 
a ree J a! \/ A 
t ATR V 

eo AA \ 

| ‘ 
14 ving } i. / 
ow \ 
Zep / « s ’ | 
S / 9 ‘ i 
Vv é 
» 20) Gc® Go-mes 
z SEMA POSITIONS 8-7 ome 
rw) 6-0 + . 
i 
A or ee ee ee ee Le 
ws a 3 2 ee ee 
on“ ea ee ae ae a ie “ 7 , x - 
NUMBER OF REINFORCEMENTS 
Fic. 3. Recall as a function of serial position 


when strength of associations have been equal- 
ized at end of learning. 
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it would appear that the most con- 
servative conclusion is that there is no 
difference in retention as a function of 
serial position when strength of items 
at end of learning is equalized. 
Relearning.-The mean number of 
trials required to relearn each list was 
given in Table 3. These values 
reflect almost perfectly the learning 
values as given in Table 2. By 
analysis of variance both meaningful- 
ness and intralist similarity are highly 
significant sources of variance in re- 
learning. Contrary to learning 
scores, however, the interaction be- 
tween meaningfulness and similarity 
does not even approach significance. 


Discussion 


The discussion will be built around the 
implications of the present results for an 
interference theory of forgetting. We 
may first note that the absolute amount 
of forgetting in the present experiment is 
much less than usually reported. For 
example, in Hovland’s study (4) in 
which 12 syllables were learned and 
recalled after 24 hr., the mean number 
recalled was 2.61. In our previous 
study (11) using 13 syllables the mean 
number recalled was approximately 4.5, 
varying in small amounts for different 
conditions. In both experiments S 
served in a number of conditions. In 
the present study the average number 
recalled for all conditions was 6.82 out 
of the 10-item list. Such findings make 
it quite clear that when S§ serves in a 
series of conditions the major source of 
interference is the previous lists learned 
(3, 14). In the present study no previ- 
ous lists had been learned to interfere 
with recall attempts. And, when we 
measure forgetting with expected im- 
mediate recall as a base, the amount of 
forgetting ranges from .96 to 1.90 items. 
It is apparent that compared with usual 
findings not much forgetting has taken 
place, and that when previously learned 
lists are eliminated as a source of inter- 
ference, the amount of forgetting for 
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which an interference theory must ac- 
count is markedly reduced. It should 
also be mentioned that there is little 
evidence of lack of warm-up. For all 
lists combined only 1.3% forgetting was 
observed in the first item in the list and 
1.2% in the second. The first item may, 
therefore, be depressed a very small 
amount due to inadequate set, but it is 
certainly of little consequence for the 
over-all retention of the list. 

Osgood (8) has indicated that to 
maintain an interference theory it must 
be shown that low-meaningful material 
is better retained than high-meaningful 
material. This must be so _ because 
verbal habits learned in everyday ex- 
periences would not as likely interfere 
with low-association syllables as they 
would with high-association syllables. 
In the present study we did not find 
statistically significant differences in 
retention between the high- and low- 
meaningful syllables, but the differences 
were in the expected direction. More 
extreme levels of meaningfulness (such 
as meaningful words os. consonant syl- 
lables) might well show larger differences. 
In any event, it is quite clear that with 
intralist similarity low, serial syllables of 
high meaningfulness are not better 
recalled than syllables of low meaning- 
fulness. 

Gibson’s theory (1) holds that the 
major source of forgetting is the inter- 
ference produced by recovery of intralist 
generalization tendencies with the pas- 
sage of time. As indicated elsewhere 
(14) we have found this a very fruitful 
notion provided that the interference 
produced by recovery of generalization 
is extended to include previous lists and 
previous extra-experimental learning. 
Earlier attempts to support the theory 
were clouded by the confounding of 
interlist and intralist similarity. In the 
present experiment interlist similarity 
was eliminated. So far as we know, the 
difference in recall for high and low 
intralist similarity for the low-meaningful 
lists is the first clear-cut substantiation 
of the Gibson hypothesis. According to 
the theory there is recovery of generali- 
zation among similar items over the 24-hr. 
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interval introducing more interference at 
recall than in the case of a list with low 
similarity. But why doesn’t the same 
difference obtain for the lists of high 
meaningfulness? Two possibilities seem 
most likely. First as implied in Gibson’s 
theory, the more meaningful the material 
the less rapid the spontaneous recovery 
of generalization. This would mean 
that for longer retention intervals than 
used here we would predict that sooner 
or later the high-similarity list would 
show greater forgetting than the low- 
similarity list. The second possibility 
is that with high intralist similarity there 
is less interference from previous learn- 
ing. An experiment in which high- 
similarity lists were better retained than 
low-similarity lists suggest this possi- 
bility (15). This position would say 
that for the lists of high meaningfulness, 
high intralist similarity did produce a 
decrement in retention due to inter- 
ference from within the list but a roughly 
equivalent amount occurred in the low- 
similarity list because of interference 
from previous learning. We have no 
basis in the present data for choosing 
between these two alternatives. 

The above considerations lead to 
certain definite principles. If a single 
list is learned, the list with the lowest 
meaningfulness and the highest intralist 
similarity (within limits) will show most 
rapid forgetting. The list with the 
lowest meaningfulness and lowest intra- 
list similarity will show the slowest rate 
of forgetting. Indeed, it seems possible 
that if a list of consonant syllables of 
very low meaningfulness was constructed 
so that no letter was duplicated in the 
list, little forgetting would occur over a 
long period of time.- The present list of 
low meaningfulness and low intralist 
similarity had duplicated vowels and it 
is quite possible that these produced 
interference which is entirely responsible 
for the small amount of forgetting which 
occurred. As meaningfulness increases, 
however, we increase the likelihood that 
extra-experimental learning will interfere 
with recall. We cannot be entirely 
confident that relative magnitude of 
forgetting suggested by these principles 














will hold with paired-associate learning, 
since previous analyses (14) suggest that 
paired-associate lists are more susceptible 
to interference from previous habits than 
are serial lists. Even with very low- 
meaningful lists of paired associates it is 
possible that forgetting would be ap- 
preciable over 24 hr. 

Hovland (4) found little difference in 
recall as a function of serial position 
after 24 hr., whereas the present recall 
results show the typical bowed curve as 
found in learning. It is our belief that 
this contradiction is related to differences 
in interlist interference between the two 
experiments. Hovland’s Ss had learned 
many serial lists which would interfere 
considerably with recall. Not only 
would there be interference among 
specific items but also among lists as 
wholes, since the same anticipatory cue 
for the first syllable was the same for all 
lists. The interference should be 
greatest during the attempted recall of 
the first several items as S would have 
difficulty in establishing “which list this 
was.” Thus, recall of the first items 
should be severely depressed. By the 
time the middle items appeared, how- 
ever, S should have established what 
list was involved so that these items could 
be as well recalled as the first items even 
though they are weaker. Yet by this 
reasoning the items in the last portion of 
the list should be better recalled than 
any; this is not the case, and at the 
present time we have no resolution of this 
difficulty. Nevertheless, the fact that 
in Hovland’s study the bowed curve was 
very apparent on the second relearning 
trial is quite in line with previous 
findings of the transitoriness of inter- 
ference among serial lists (14). 


SUMMARY 


The purpose of this experiment was to deter- 
mine the influence of meaningfulness, intralist 
similarity, and serial position on retention of 
serial lists when Ss had had no previous labora- 
tory experience with such lists. Nonsense- 
syllable lists of 10 items each were used, the four 
lists representing two extremes of meaningfulness 
and two extremes of intralist similarity. A total 
of 400 Ss, 100 for each list, learned to one perfect 
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trial'and recalled and relearned after 24 hr. 
results show: 


The 


1. In learning, the lower the intralist simi 
larity the faster the learning and the higher the 
meaningfulness the faster the learning. Intra- 
list similarity produced a greater difference with 
the low-meaningful lists than with the high. 

2. For low meaningfulness, recall was signifi- 
cantly better with low intralist similarity than 
with high intralist similarity. No difference was 
observed for the high-meaningful lists. 

3. Meaningfulness did not significantly in- 
fluence recall. 

4. Recall was not related to serial position 
when response strengths at end of learning were 
equalized for all serial positions. 


The amount of forgetting observed was con- 
siderably less than found with comparable 
materials in experiments in which S served in 
several conditions. The difference seems almost 
without doubt to be due to large amounts of 
interlist interference which occurs when S serves 
in several conditions. The results provide 
strong support to a general theory of inter 
ference as a prime cause of forgetting and to 
Cibson’s theory in particular. 
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TEMPO OF FREQUENCY CHANGE AS A CUE FOR 
DISTINGUISHING CLASSES OF SPEECH SOUNDS! 


ALVIN M. LIBERMAN,?* PIERRE C. DELATTRE,* LOUIS J. GERSTMAN, AND 
FRANKLIN 5S. COOPER 


Haskins Laboratories, New York City 


In recent experiments with syn- 
thetic speech (3, 5, 6, 7) we have 
isolated many of the acoustic cues 
by which a listener identifies various 
consonants of American ~ English. 
Some of the cues that we have found 
to be important for the voiced stops 
(b, d, g), the voiceless stops (p, t, &), 
and the nasal consonants (m, n, 9) 
are illustrated in the hand-painted 
spectrograms of Fig. 1. These spec- 
trographic patterns are very highly 
simplified and schematized by com- 
parison with spectrograms of real 
speech, yet capable of producing fair 
approximations to the intended con- 
sonant-vowel syllables when con- 
verted into sound by an appropriate 
playback instrument. The vowel, 
which is a in all cases, is given by the 
two concentrations of acoustic energy, 
or “formants” as they are called, the 
one centering at about 720 cps and 
the other at 1320 cps. The consonant 
cues, with which we will be more 
concerned, begin at the left-hand side 
of each pattern and extend to the 
point at which the formants assume 
the steady state that characterizes the 
vowel. 

As can be seen in the hand-p.inted 
spectrograms, the acoustic cues that 
distinguish the columns of the figure 
(i.e., b-p-m from d-t-n from g-k-9) 
are the direction and extent of the 
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relatively rapid shifts in the frequency 
position of the second (higher) for- 
mant. These frequency shifts, or 
“transitions,” are typically found in 
spectrograms of real speech at the 
junction of consonant and vowel.‘ 
By converting patterns like those of 
Fig. 1 into sound, we have found that 
the second-formant transitions can, in 
fact, be cues for the perceived dis- 
tinctions among the three classes, 
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Fic. 1. Hand-painted spectrograms of nine 
consonant-vowel syllables, showing some of the 
principal acoustic cues for the perception of the 
stop and nasal consonants. 


b-—p-m, d-t-n, and g-k-y. ‘There are 
other acoustic cues for these same 
classes—we have discovered that ap- 
propriate transitions of the third 
formant, for example, contribute to 
the identification of these sounds 
but our results indicate that the 
second-formant transitions are very 
nearly sufficient. 


*The transitions presumably reflect the 
changes in the oral cavities that necessarily occur 
as the articulators move from the position of the 
consonant to that of the vowel, 
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We know rather less about the cues 
that distinguish the patterns that 
form the rows (i.e., b-d-g, p-t-k, 
m-n-y). Though we have isolated 
several cues for the distinction be- 
tween the voiced (b-d~g) and voiceless 
(p-t-k) stops, we have not yet investi- 
gated the problem carefully enough to 
know which of these are most im- 
portant. That omission need not 
concern us here, however, since for 
this paper the most relevant con- 
sideration is the evidence that each 
of these cues is essentially constant 
within a row (of Fig. 1) and more or 
less capable of distinguishing that row 
from others. As shown in the figure, 
one cue that marks the voiced stops 
(b-d-g) as a class is the low-frequency 
voice bar that immediately precedes 
the transitions. The voiceless stops 
probably depend in part on the pres- 
ence of aspiration (i.e., noise) in place 
of harmonics in the first part of the 
transition. For the distinction be- 
tween the nasal consonants (m-n-—n) 
aad stops (b-d-g, p-t-k) the short, 
steady-state resonance seen at the 
beginning of each of the nasal con- 
sonants in the bottom row is quite 
adequate. 

The patterns of Fig. 1 provide the 
basis for a table of acoustic elements 
out of which many of the sounds of 
speech can be made.’ The table is 
interesting, we think, because it 
indicates that a small number of cues 


*The 3X 3 table of Fig. 1, in which the 
patterns are arranged in terms of the acoustic 
cues, parallels a commonly accepted phonetic 
classification according to the articulatory di- 
mensions of place and manner of articulation. 
The three columns correspond to three places of 
production (i.¢., points along the oral tract at 
which the consonant closure is made), p-b—m 
being produced at the lips, t-d—n at the alveolar 
ridge, and k-g-y at the velum. Manner of 
production refers to the particular articulatory 
features (for example, the presence or absence 
of voicing or nasality) that are common to the 
sounds in any given row. 
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on one acoustic dimension (second- 
formant transition) combine in all 
possible pairs with various acoustic 
markers (the constant resonance of 
the nasal consonants, for example) to 
produce some of the highly distinctive 
sounds of speech. Thus, highly 
identifiable stimuli are created out 
of the wholly unpatterned combi- 
nation of simple and discrete elements. 
We expect that the table of Fig. 1 
will ultimately be expanded to include 
all the consonant sounds. It is one 
of the purposes of this study to take 
a step in that direction.® 

Exploratory work suggested that 
we might convert the patterns of Fig. 
1 into different speech sounds by 
varying only the tempo of the transi- 
tions, thus pointing to time as another 
of the stimulus dimensions which may 
be important in the perception of the 
individual phones of speech. As the 
transitions are progressively slowed, 
the pattern for the stop consonant b 
plus the vowel a (as in bottle) begins 
to be heard as the semivowel w plus a 
(as in wobble), and then as a vowel 


*Students of language have long found it 
useful to describe each consonant as an articu- 
latory event in which one of a small number of 
places of production is combined with one 
manner. The specification of place and manner 
is, then, sufficient to describe a consonant 
uniquely, To the extent that the relation 
between articulation and sound is not too com- 
plex, we should expect that the categories of 
place and manner might be equally appropriate 
for a classification of the acoustic characteristics. 
That such a classification has utility for a study 
of speech perception has recently been shown in 
an important experiment by Miller and Nicely 
(8). They took account of the confusions that 
occur when English consonants are heard against 
progressively increasing amounts of noise and 
found, for example, that a listener will continue 
to identify a sound correctly as being a member 
of a certain manner class even though he does 
not hear it accurately in regard to its place of 
production. In general, their results indicated 
that the place and manner cues, whatever their 
nature, may be perceived quite independently 
of each other. 
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which changes color from u to a (as 
in too odd). Similarly ga (as in 
goggle) goes to ja (as in yacht), and 
then to ia (as in theology). The 
remaining stop, d, goes through com- 
parable transformations, producing 
(with the vowel a) the semivowel y 
(as in French nuage) and then the 
vowel-of-changing-color ya (as in 
French cru a).’ 

The experiments to be reported 
here were designed to extend our 
exploratory observations concerning 
transition tempo as a cue for the per- 
ceived distinctions among stop con- 
sonants, semivowels, and vowels of 
changing color. 


EXPERIMENT I 


In this part of the study we have 
tried to determine whether changing 
the tempo of first- and second- 
formant transitions is sufficient to 
convert the syllables be and ge (stop 


consonant plus vowel) into we and je 
(semivowel plus vowel) and, in the 
extreme, into we and ie (vowels of 


changing color). The semivowel 
(plus vowel) ye and vowel-of-changing- 
color ye, which presumably result 
from a slowing of the d transition, are 
not familiar to our American listeners 
and were for that reason excluded 
from the experiment. 

The vowels of changing color are 
very different linguistically (and per- 
ceptually) from stops and semivowels 
in that they are not single linguistic 
units and occur only in situations in 
which one of the vowels belongs to 
one syllable and the other to the next. 
We thought it appropriate, therefore, 


7In articulatory terms 6 and w have a com 
mon place of production at the lips; d and y are 
both produced at the alveols. The semivowel 
j is normally articulated at a point slightly 
forward of the velar g; however, there is in 
English no semivowel whose place of articulation 
is closer to g. 
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to deal separately with the two- 
category distinction between stop 
and semivowel and the three-category 
choice among stop, semivowel, and 
vowel of changing color. 


Method 


General aspects of procedure and apparatus. 
In these experiments hand-painted spectrograms 
have been used as a basis for creating and con 
trolling speech-like sounds. To convert the 
spectrograms into sound—an obviously neces 
sary step in this method-—we take advantage of 
a special-purpose instrument called a pattern 
playback. Descriptions of this instrument, 
together with discussions of our method, are to 
be found in earlier papers (1, 2, 4). 

The playback employs a variable-density tone 
wheel to modulate the light from a mercury arc, 
producing a fundamental of 120 eps and all its 
harmonics through the fiftieth at C000 cps. The 
modulated light beams are imaged on the 
spectrogram, and are so spread across it as to 
match its frequency scale. As the hand-painted 
spectrogram is moved through the light, the 
white paint reflects beams whose modulation 
frequencies correspond to the position of the 
paint on the frequency scale of the spectrogram. 
The reflected beams are led by plastic light 
guides to a phototube, the current of which is 
amplified and converted to sound. 

Stimuli.—To vary the tempo of the transi 
tions, we prepared (for conversion into sound) 
spectrographic patterns like those shown in Fig. 
2. The series of patterns illustrated in the top 
and bottom rows were designed to produce be, 
we, ue, and ge, je, ie, respectively. In each 
series, the duration of first- and second-formant 
transitions was varied from 10 msec. to 300 
msec. in steps of 10 msec. When we vary the 
duration of the transition, we are also, of course, 
varying the rate of transition (that is to say, the 
frequency shift per unit time) so long as the 
frequency extent of the transition 
constant. For convenience, we will present 
specific values in terms of duration; however, 
we will continue to speak of this variable 
generally as tempo. 

As can be seen in Fig. 2 the first- and second 
formant transitions were varied together, the 
durations of the two being always equal in any 
one sound. This approximates what would 
seem, on the basis of an inspection of spectro- 
grams, to happen in real speech 

The steady-state portion of the pattern has a 
duration of 300 msec. in all cases, and its 
formants are so placed as to produce an approxi 
mation to the vowel ¢e. By using this vowel, it 


remains 





LIBERMAN, DELATTRE, GERSTMAN, AND COOPER 





be-we-uwe SERIES 
4 


Peeps 


— eS 


| 














QUENCY IN CPS 


gt-je-ie SEnies 


FRE 





be a 


po 


| 














TIME IN MSEC. 


Fie. 2. 


Illustrations of the spectrographic patterns used to produce the stimuli of Exp. I. The 


first four patterns in each row show how the tempo of the transitions was varied. At the extreme 
right of each row is a complete stimulus pattern, i.e., transition plus steady-state vowel, for the 


longest duration of transition tested. 


has been possible to make the frequency shifts 
for the be-we-ue series exactly equal in extent 
to those that produce ge-je-ie. The b transition 
begins 720 cps below, and the ¢ transition 720 
cps above, the steady state of the second formant 
of the vowel (1800 cps). The first-formant 
transition begins at 120 cps and rises to the 
steady state of the first formant at 480 cps. 
The transitions of actual speech are not, of 
course, so angular as those shown in Fig. 2. We 
have found, however, that drawing the transi- 
tions as we have in that figure does not adversely 
affect the synthetic speech, and it does, for 
obvious reasons, enable us to control transition 
durations more precisely. Each formant con- 
sisted of a central strong harmonic and two 
flanking harmonics of lower intensity. 
Presentation of stimuli.—As was pointed out 
in the introduction to Exp. I, we had thought it 
desirable to experiment separately with the two- 
category distinction between stop and semivowel 
and the three-category choice among stop, semi- 
vowel, and vowel of changing color. To in- 
vestigate the distinction between stop and semi- 
vowel we assembled two sets of test stimuli. 
One, which we will call Test B-W, included 12 
patterns from the series illustrated in the top 
row of Fig. 2 (rising second-formant transitions). 
The transition durations of these 12 patterns 
ranged from 10 to 120 msec. in steps of 10 msec., 
and had seemed on the basis of exploratory work 
to cover quite adequately the range from be 
through we. The other set of stimuli, which we 


will call Test G-J, included the 12 corresponding 
patterns from the bottom row of Fig. 2 (falling 
second-formant transitions). These had ap- 
peared in exploratory work to produce sounds 
that ranged from ge through je. 

For the three-category choice among be, we, 
and ue, we used all the patterns of the series 
illustrated in the top row to make up a set of 
stimuli that we will refer to as Test B-W-U. 
The series in the bottom row was used in con- 
nection with the distinctions among ge, je, and 
ie, and will be referred to as Test G-J-l. For 
each of these two tests there was a total of 30 
stimuli with transition durations that ranged 
from 10 to 300 msec. in steps of 10 msec. 

The spectrographic patterns in each of the 
four tests were converted into sound and re- 
corded on magnetic tape. By cutting and 
splicing the magnetic tape we assembled two 
random orders of stimulus presentation for each 
test. The signals were arranged on the magnetic 
tape in such a way that each stimulus would be 
presented and then repeated after an interval 
of lsec. There was an interval of 6 sec. between 
successive pairs of stimuli (i.e., a sound and its 
repetition). This interval provided time for S 
to make and record his judgment. The Ss 
heard and identified each stimulus only once in 
Tests B-W and G-J. Half the Ss heard the 
stimuli in one random order, and half in the 
other. The Ss who identified the stimuli of 
Tests B-W-U and G-J-I made two judgments 
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of each stimulus, each of these tests having been 
given to the Ss once in each random order, 

The Ss were told that each stimulus would be 
a synthetically produced syllable consisting of 
an initial speech sound followed in all cases by 
the vowel e. They were asked to identify only 
the initial sound, and to limit their responses to 
the choices offered by E. For tests B-W, G-J, 
B-W-U, and G-J-I these choices were b or w, 
g or j, b, w, or u, and g, 7, or i, respectively. 
Examples of these sounds, in syllabic contexts 
with the vowel ¢, were given. The Ss were 
urged to make an identification of every stimulus, 
even though their judgments might in some cases 
be guesses. Before Ss made judgments in any 
test they were asked to listen to the first four 
stimuli of the series in order that they might 
become familiar with the nature of the sounds 
and the general method of presentation. 

The four tests were variously interspersed 
among other tests involving synthetic speech 
(for example, groups of stimuli set up to study 
the fricative and nasal consonants), and some- 
times two tests of the present experiment were 
given to the same group of Ss at the same 
session. The only restriction was that Test 
B-W was never paired with Test B-W-U, and, 
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similarly, that Test G-J was never paired with 
Test G-J-I in the same session. An analysis 
of the results showed essentially no effect of the 
context in which a particular test was given (or 
of the random order used), so in reporting the 
results we have combined all the judgments for 
each test. 

Subjects.—A total of 168 paid volunteers, all 
of them undergraduate students at the Uni- 
versity of Connecticut, served as Ss. Of this 
group, 59 took Test B-W and @ took Test G-J. 
The stimuli of Tests B-W-U and G-J-I were 
presented to groups of 41 and 49 listeners, 
respectively. Of the 41 listeners for Test 
B-W-U, 19 had previously heard and identified 
the stimuli of Test G-J ; 22 of the 49 listeners for 
Test G-J-I had previously served in Test B-W. 
Prior to serving in this experiment, none of the 
Ss had had experience in identifying the syn- 
thetic speech sounds produced by the pattern 
playback. 


Results 


Figure 3 shows that our listeners 
were able to use the tempo of first- 
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transition. 


The distinction between stop consonant and semivowel as a function of the tempo of 
These curves show the percentage of stop consonant responses, and are based on the 


judgments of separate groups of 59 and 60 listeners for the b-tw and g-j distinctions, respectively. 


Tempo is here expressed in terms of duration. 
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and second-formant transitions as a 
cue for distinguishing between stop 
consonant and semivowel. The 
change from b to w occurred when the 
duration of the transitions reached 40 
msec., while g changed to 7 in the 
neighborhood of 50-60 msec. It is 
apparent from both curves that the 
very shortest duration of transition 
did not produce the best stop 
consonant. 

Figure 4 shows the judgments we 
obtained when the range of transition 
tempos was increased and a third 
judgment category (ue or ic) was 
added. Apparently, our Ss can re- 
liably discriminate the three cate- 
gories, stop, semivowel, and vowel of 
changing color on the basis of tempo 
of transition alone. It is also ap- 
parent, however, that the amount of 


Ino 
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agreement among our subjects was not 
so great in the three-category as in the 
two-category situations (cf. Fig. 3). 
The expansion of the stimulus range 
and the inclusion of the third judg- 
ment category obviously had the least 
effect on the responses to the stimuli 
at the short-duration end of the scale. 


Experiment II 


All the results of Exp. I, which 
indicated that transition tempo can 
distinguish stop consonant from semi- 
vowel from vowel of changing color, 
were obtained with the vowel e fol- 
lowing the initial transition. In Exp. 
II we have tried to find out how 
listeners respond to variations in 
transition tempo when vowels other 
than é€ constitute the second part of 
the syllable. This experiment was 
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The distinctions among stop consonant, semivowel, and vowel of changing color as 
The curves representing the be-we-ue and ge-je-ie responses 


are based on the judgments of separate groups of 41 and 49 listeners, respectively. ‘Two judgments 


of all stimuli were obtained from each listener. 


As in Fig. 3, tempo is expressed in terms of duration. 
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stimulus pattern is shown at the extreme right. 


concerned only with the distinction 
between the stop consonant b and the 
semivowel w. We have omitted the 
g-j distinction because to produce g 
with a variety of vowels requires a 
radical shift in the frequency level at 
which the second-formant transition 
begins, and introduces complications 
which are largely irrelevant to our 
present purposes. 

In Exp. I rate and duration of 
transition were varied together. This 
was unavoidable, of course, because 
the extent of the transition—that is 
to say, the frequency range through 
which the formant moved—remained 
constant. To produce the various 
vowels that have been used in Exp. 
Il we have had, necessarily, to put 
the steady-state formants at several 
different levels. By 
starting the transitions from the same 
point for all these vowels, as can be 
done with b, we have been able to 
vary the extent of the transitions 
(from vowel to vowel) and thus to 
separate the rate and duration aspects 
of transition tempo. The results of 
this experiment should, therefore, 
help to determine which of these 
variables—rate or duration—is the 
controlling cue. 


frequency 


Method 


Stimuli.—In Fig. 5 are samples of the spec- 
trographic patterns that were used to produce 


Illustrations of the spectrographic patterns used to produce the stimuli of Exp, Il 
only of the 15 transition durations, viz., 50 msec., is shown with each of the vowels 


One 
\« omplete 


the stimuli of this experiment. These patterns 
are the same as those of the B-W test of Exp. I 
except in three respects. The principal differ 
ence concerns the steady-state vowel. In Exp 
I the first and second formants of the steady 
state portion of the syllable were always set at 
frequency levels that would produce synthetic 
approximations to the vowel e. (See Fig. 2) 
In Exp. II there are six different steady-state 
levels of the formants, these levels being ap 
propriate to the vowels i, ¢, ¢, 4, 9, ando. The 
second difference is in the frequency at which 
the second-formant transitions begin— 1080 cps 
in Exp. I and 600 eps in Exp. IL. It was 
necessary in Exp. II to lower the starting point 
of the second-formant transition in order to 
produce 6 and w with vowels whose second 
formant frequencies are as low as those of 9 and 
o. For syllables that contain the other vowels, 
i, ¢, €, and a, it is not necessary to start the 
second formant at so low a frequency, but, as 
we pointed out above, it is possible to do so." 
We thought in this case that it would be de 
sirable to start all the second-formant transi 
* To produce the very best b's we must start 


the second-formant transition at levels higher 
than 600 cps, especially when the } is followed 
When we 
start this transition from a point as low as (OO 
cps, we not only produce a somewhat inferior b, 
but we tend in some cases to add at least a 
suggestion of bw. This is to Say that starting 


the second-formant transition at or 


by a front vowel such as i, ¢, or ¢ 


near the 
b w locus is itself a cue for the semivowel w 
For a discussion of “locus” see (4) 

The addition small amount of w is 
counterbalanced, perhaps, by the fact that 
starting the first formant at 120 cps, as we do in 
all the patterns of this experiment, provides a 
cue for the stop consonant b as opposed to the 
semivowel w. 


of a 


To synthesize the best semivowel 
we must start the first formant at a somewhat 
higher frequency. 
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The distinction between b and w with various vowels. 


These curves show the percentage 


of b judgments as a function of transition tempo when tempo is expressed as duration of transition. 


Each of 38 listeners judged all stimuli twice. 


Duration is scaled logarithmically to make these 


curves more directly comparable with those of Fig. 7 and 8. 


tions at the same low frequency because in so 
doing we produced the greatest variation in 
extent of transition from vowel to vowel and 
thus obtained the greatest separation of rate 
and duration. The third difference between the 
patterns of Exp. I and those of Exp. II concerns 
the particular transition durations we chore to 
test. The results of Exp. I (see. Fig. 3) had 
suggested that it would be most appropriate to 
sample the range 10 to 160 msec., using smaller 
steps at the short end and larger steps at the long 
end. Accordingly, we selected for use in Exp. 
II the following 15 values of transition duration 
for each vowel: 10, 15, 20, 25, 30, 40, 50, 60, 70, 
80, 90, 100, 120, 140, and 160 msec. 
Presentation of stimuli.—The 9 stimuli of 
Exp. II (15 transition durations times six 
vowels) were converted into sound on the 
pattern playback and recorded on magnetic 
tape. By cutting and splicing the magnetic 
tape we prepared two random orders of these 
sounds, The presentation of the stimuli was 
exactly as it had been in the B—W test of Exp. I, 
except that each S heard and judged the stimuli 
once in each random order. Thus, two judg- 
ments of each stimulus were obtained from each 


Subjects.—There were 38 Ss, all of whom were 
paid volunteer undergraduates at the University 
of Connecticut. None of the Ss had previously 
served in Exp. I or had any other opportunity to 
hear the synthetic speech sounds produced by 
the pattern playback. 


Results 


We see in Fig. 6 that transition 
tempo is sufficient to distinguish the 
stop consonant b from the semivowel 
w with each of a wide variety of 
vowels. A comparison of the curves 
of Fig. 6 with those obtained for the 
vowel ¢ alone (see Fig. 3 in Exp. 1) 
shows no very great difference. 

When our listeners’ judgments are 
plotted against transition duration, 
as in Fig. 6, the curves for the various 
vowels are very nearly superimposed. 
We may reasonably wonder what 
these curves will look like when the 
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judgments are plotted against rate of 
transition, since, as we pointed out 
earlier, rate and duration can be 
separated in this experiment. Such 
plots are shown, separately for second 
and first formants, in Fig. 7 and 8. 


It will be remembered that our 
stimulus patterns were so drawn as to 
make the duration of first- and second- 
formant transitions always equal in any 
one pattern. Transition rates, on the 
other hand, are different for first and 
second formants inasmuch as the transi- 
tion extents differ. In considering the 
responses as a function of rate, we must, 
therefore, deal with first and second 
formants separately. 

We should note in regard to Fig. 8 
that duration and rate of first-formant 
transition covary for the vowels ¢e-o and, 
also, for €—9, since, as can be seen in the 
stimulus patterns of Fig. 5, the extent of 
first-formant transition is the same 


100 
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within each of these two pairs. It 
follows, then, that if the data for ¢ and o, 
or for € and 9, should yield similar curves 
when plotted against duration, they 
would necessarily produce similar curves 
against rate of first-formant transition. 


Clearly the curves for the various 
vowels are not so nearly coincident 
for either of the rate plots as they are 
when the abscissa is laid out in terms 
of duration. This result would ap- 
pear to support an assumption that 
duration of transition is the essential 
component of what we have been 
calling “‘tempo’’—that is to say, that 
duration rather than rate is the 
controlling cue. 


In assessing the relative importance of 
duration and rate we should take account 
of the fact that the rates of first- and 
second-formant transitions move in op- 
posite directions in the vowel series from 
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of rates for each of the vowels. 
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The same data as shown in Fig. 6 and 7, here plotted against 


rate of first-formant transition. 


ithrough a. (The extent of the second- 
formant transition decreases from 1800 
cps for i to 720 cps for a, while, for the 
same change in vowel, the extent of the 
first-formant transition increases from 
120 cps to 600 cps. Hence, for a given 
duration of transition, the rate of the 
second-formant transition decreases and 
the rate of the first-formant transition 
increases as the vowel is changed from i 
through ¢ and ¢€ to a.) It is possible, if 
unlikely, that the rate cues produce 
effects which cancel each other in such 
a way as to generate curves that appear 
to be invariant with duration. 

For the vowels a, 9, and o, the extents 
(and hence the rates) of first- and second- 
formant transitions vary in the same 
direction, so we ought with these vowels 
to be able to make a less ambiguous 
comparison of the roles of duration and 
rate. If we look at the results for these 
three vowels, we see that the curves 
appear to be more closely bunched when 
plotted against duration than when 
plotted against rate, though in these 


cases the relatively small change in 
frequency extent of transition provides 
something less than an ideal basis for 
comparison. 

Obviously, the clearest separation of 
rate and duration could be made if we 
were to hold the transition of one 
formant constant in all respects while 
varying the rate, duration, and extent 
of the other. A series of exploratory 
studies made it clear that for distin- 
guishing stop and semivowel the tempo 
of the second-formant transition is con- 
siderably more important than that of 
the first. Nevertheless, we failed to 
produce a highly realistic series from 
stop to semivowel with one transition 
fixed, and we have therefore been unable 
by this means to obtain direct evidence 
in regard to rate vs. duration. 


We have found in exploratory work 


that the conversion of the stop 
consonant b to the semivowel w is 
most effectively accomplished if, in 
addition to slowing the transitions, 
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we also make other changes in the 
spectrographic pattern. The ac- 
curate specification of these other cues 
and an evaluation of their relative 
contributions have yet to be made. 
At the present time we can only say 
that we have so far found no cue that 
promises to be more important than 
the time cue we have isolated in the 
present experiment. In any event, 
it is clear from the data reported here 
that the tempo or, more specifically, 
the duration of transition is a suf- 
ficient cue for distinguishing stop from 
semivowel from vowel of changing 
color. Thus, duration is to be added 
to the list of dimensions that are 
important for the discrimination and 
identification of individual 
sounds. 


speech 


SUMMARY 


It had been shown in earlier research that the 
direction and extent of second-formant transi- 
enable listeners to distinguish speech 
sounds within each of the three classes, voiceless 
stops (p-t-k), voiced stops (b-d-g), and nasal 
consonants (m-n-y). In distinguishing among 
these classes, listeners apparently depend on 
certain other cues, such as the presence of a nasal 
resonance or voicing, each of which serves as an 
acoustic marker for its class. In the first part 
of the present experiment it was found that, with 
all other things equal, the tempo of the transi- 
tions was sufficient to distinguish members of 
the class of voiced stop consonants from cor- 
responding members of the classes semivowels 
and vowels of changing color. In the syllable 
consisting of the stop consonant b plus the vowel 
e, the stop 6 waa transformed into the semivowel 
w when the duration of first- and second- 
formant transitions exceeded 40 msec.; the 
corresponding change from ge to je occurred at 
50 or msec. Further increases in the duration 
of the transitions caused we and je to become the 
vowels-of-changing-color ue and ie, respectively. 
The shift from semivowel to vowel of changing 
color was much less sharp than the change from 
stop to semivowel. 


tions 
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In the second part of the study it was found 
that transition tempo served equally well for 
distinguishing b from w with many vowels other 
than the one (¢) used in obtaining the results 
just described. By investigating the tempo cue 
with a variety of vowels, it was possible (as it 
had not been in the first part of the study) to 
assess the relative contributions of duration and 
rate of transition. The results suggest that 
duration is the significant aspect of the tempo 
cue. 
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PREDICTION OF DECISIONS FROM A HIGHER 
ORDERED METRIC SCALE OF UTILITY? 


PAUL M. HURST* AND SIDNEY SIEGEL 


The Pennsyloania State University 


The purpose of this experiment was 
to compare the efficiency of two 
models for predicting a person’s 
choices between alternative bets. 
One model bases all predictions on 
the person’s previously determined 
higher ordered metric scale of utility 
for the commodity for which he 
gambles, asserting that he will make 
his decisions so as to maximize ex- 
pected utility. The other bases all 
predictions on the objective or “‘face”’ 
value of the commodity, asserting that 
the person will make his decisions so 
as to maximize expected value. A 
secondary purpose of the study was 
to identify one source of those errors 
which do occur in predictions based 
on the higher ordered metric scale of 
utility. 

The concept of utility (subjective 
value, or value to the person) is 
central in decision theory, a field 
which has been reviewed by Edwards 
(5). Recent research on the interval 
measurement of utility has been 
reported by Davidson, Siegel, and 
Suppes (4), and by Edwards (6). 
Coombs and Beardsley (2), Coombs 
and Komorita (3), Siegel (8), and 
Siegel and Beltz (9) have reported 
research concerned with ordered 
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metric and higher ordered metric 
measurement of utility. 

An ordered metric scale (1, 8) 
ranks the distances between an indi- 
vidual’s preferences among certain 
commodities. It may be used with 
both discrete and continuous com- 
modities. The method of higher 
ordered metric scaling devised by 
Siegel (8), which is based upon the 
work of Ramsey (7) and Von Neu- 
mann and Morgenstern (10), and 
which draws upon the developments 
of Davidson, Siegel, and Suppes (4), 
yields a preference scale containing 
the following information: “(a) it 
orders the entities involved, i.e., 
A>B>C>D>E... ; (db) it 
orders the distances between the 
entities, i.c., say DE > AB > CD 

. , where AB implies the difference 
in utility between A and B; and (c) it 
orders all possible combinations of 
contiguous distances between entities, 
i.e. say AB + BC > BC + CD 
+ DE, or AB > CD + DE” (8). 

In the present study, inmates in a 
federal prison were used as Ss. 
Cigarettes, which are negotiable in 
prison society, were used as the 
commodity in the gambling situation, 
and each S’s utility of cigarettes was 
measured in a higher ordered metric 
scale. The information yielded by 
the study was used to test two 
hypotheses. 

The first hypothesis concerned the 
relative predictive efficiency of the 
two models mentioned above: 


l. Ss will make decisions so as to maximize 
expected utility rather than expected objectie 
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value, and thus the predictions of decisions 
based on Ss’ higher ordered metric scales of 
utility will be more accurate than those based 
on objective value 


The second hypothesis, stated be- 
low, was directed at identifying one 
source of error in prediction from a 
higher ordered metric scale. The 
differences in utility on the scale are 
used to predict S’s choices between 
alternative gambles. It is possible, 
however, that S might find a given 
set of two alternative gambles equally 
attractive or equally unattractive 
because they are based on practically 
equal distances, and would therefore 
be indifferent in his choice between 
them. In such a choice, it is reasoned 
that S might vacillate considerably 
before stating a decision. That is, 
the latency time between the offer 
of the choice and the statement of a 
decision would be long, reflecting S’s 
lack of a clear-cut preference. To 
test whether errors in prediction from 


a higher ordered metric scale are 
caused by the making of offers in 
which the two alternatives tend to be 
indiscriminable in expected utility, a 
second hypothesis was formulated: 


2. The latency times of those choices which are 
not predicted successfully by the principle of 
maximization of expected utility will be longer 
than the median latency time of all choices. 


Metuop 


Subjects —The Ss were 30 inmates of the 
Northeastern States Federal Penitentiary, se- 
lected from 73 volunteers who responded to an 
advertisement placed in the prison newspaper. 
The advertisement requested volunteers for an 
experiment, and did not reveal the purpose of 
the experiment or that it involved gambling. 
It offered no inducement for participation by the 
volunteers except the opportunity to acquire a 
number of cigarettes. In the prison, cigarettes 
are a negotiable commodity whose status ap- 
proaches that of a medium of exchange. 

From the list of applicants, a list of 30 Ss was 
compiled by selection on the basis of intelligence 
test scores (U. S. Public Health Service Intel- 
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ligence Test, combined scores). It was deemed 
desirable to use as Ss only those volunteers who 
were sufficiently intelligent to be able to under- 
stand the task required of them. The Ss who 
served in the experiment had IQ scores which 
ranged from 104 to 131, with a median of 113. 

A pparatus.—Sessions were held in a private 
room in the prison. 

Two of the high-precision nonsense-syllable 
dice designed by Davidson, Siegel, and Suppes 
(4), each having two zero-association nonsense 
syllables symmetrically distributed over its six 
faces, were used to establish a gambling event 
having .50 subjective probability. For a 
detailed discussion of this device and its ration- 
ale, see (4) or (8). From a cup, a die was 
rolled by S against a backstop. 

Paper was furnished to S to record the various 
offers and his choices in each case. 

A stop watch was used to measure the latency 
of each decision. The timing was done by an 
inmate assistant who worked with £. 

Procedure.—-Two sessions, separated by a 
week, were held with each S. The same 30 
offers, in the same order, were given in each 
session. 

At the beginning of each session, S was 
handed seven packages of king-size cigarettes 
of a leading popular brand and was informed 
that these consituted his “starting stake.” He 
was told that he might finish a session with well 
over a carton (10 packs) if he won, or that in 
the gambling he might lose part or all of his 
stake. He was told that whether he won or 
lost would depend on his skill in choosing the 
better alternative in each offer, and also on his 
luck in rolling the dice. The dice were then 
described to him. He was permitted to examine 
them and to make a few practice rolls, until he 
was convinced that they were fair. 

The following portion of the standard in- 
structions given each S shows how the data were 
then obtained: “... Now I'll give you a 
sample bet. This won’t count. Write down 
‘win 30’ and then below that write ‘lose 10.’ 
Use a plus to stand for ‘win’ and a minus to 
stand for ‘lose.’ Now across from that bet 
write ‘win 5’ and below that ‘win 5.’” 

The £ then ascertained that S had written 
the offer in this form: 


+ W 
~- 10 


+5 
+5 


“Now let’s use this die, and I say ‘QU is the 
winner.’ If you choose the first bet (win 30 or 
lose 10) and if QUJ comes up on your roll, you 
win 30 cigarettes. If you had chosen the second 
bet, you'd have won 5 cigarettes. On the other 
hand, if you choose the first bet and then QUG, 
the loser, comes up, you lose 10 cigarettes. But 
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TABLE 1 

Orrers 

Scale I 
1.432 +14| 6.414 -—3\11.4144 —3 
—34 +14 —34 —3 ~ -3 
2.432 +14) 7.432 -—3/12. -—3 —19 
—19 +14 ~34 —19 —34 -19 
3.432 —19| 8.414 —19/13.414 —3 
—~M% —19 —34 -—19 —-34 —19 
4.432 —3)| 9.432 —3/14.4+32 +14 
-19 —3 —34 -—3 —34 -—19 
5.432 +14/ 10, +32 +14/15. 432 +14 
—34 «3 —3 +14 —-19 —3 

Scale II 
1. +32 +17 | 6.417 +1/11.4+17 +1 
—34 +17 —34 +1 —16 +1 
2.432 +17 | 7.432 +1/12. +1 —16 
—16 +17 ~34 —16 —34 —16 
3.432 —16| 8.417 —16}13.4+17 +1 
—~34 —16 —34 —16 —34 —16 
4.432 +1) 9.432 +1/14. +32 +17 
—16 +1 —34 +1 —34 —16 
5.432 +17 | 10. +32 417/15. +32 +17 
—34 +1 +1 +17 —16 +1 


if you’d chosen the second bet in that case, you'd 
have won 5 cigarettes anyway. Now which bet 
do you want?” 

The S then chose a bet, and recorded his 
own choice by encircling it in pencil. 

“Let's roll it off and see what would have 
happened,” 

The S then rolled the die. The £ announced 
the outcome—how many cigarettes were won or 
lost on the roll. 

The same procedure was then followed for 
three more sample bets, shown below, which 
illustrate the basic types of offers: 


+10 —25 +25 -S5 -2 —10 
-—35 —25 -30 — 10 -30 —10 


The remainder of the instructions consisted 
in showing S how to record and announce his 
choices, and in ascertaining that the entire 
procedure was understood by him. 

The £ then announced that he would make 
30 offers in sequence, without stopping to roll 
the die to determine the outcome of each, and 
that all bets would then be “rolled off.” ‘That is, 
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S was instructed to make all his choices between 
the alternative gambles before any die was rolled 
to determine any of the payoffs. It was clearly 
understood by S that he could not change his 
mind after having made a choice, and that after 
all 30 choices had been made the die would be 
rolled by him for each of them and the payoffs 
thus determined. The S was informed that the 
purpose of this procedure was to save time; 
actually the primary purpose was to guard 
against the effects of reinforcement, which could 
have changed S’s utility of cigarettes in the 
course of the measurement session. 

The E concluded the instructions by an- 
nouncing that there would be no more practice 
bets—that the forthcoming bets would “count.” 
The 30 offers were then made, in the standard 
sequence shown in Table 1. In each case, S’s 
choice between the two alternatives was re- 
corded by both S and FE. Response latencies 
were timed and recorded by the assistant. 

After all his choices had been made, S rolled 
the dice 30 times, and either received or paid 
out the required number of cigarettes after each 
roll. The two nonsense-syllable dice were 
alternated on successive rolls, and the winner 
was changed in roughly random manner. Thus 
the sequence of dice and winners followed the 
form shown in Table 2. The purpose of fre- 
quently switching from one die to the other was 
to guard against the common tendency to con- 
sider a particular outcome less likely on the 
forthcoming roll if it has just occurred on 
previous rolls. The purpose of switching 
winners was to reassure S that no attempt was 
being made to trick him with biased dice. To 
accomplish this, the winner was switched more 
than randomly often. Outcomes of the die 
rolls were recorded and totaled by both E and 
the assistant. 

During S’s second session, the same procedure 
was employed as during the first session, except 
that the initial instructions were omitted. 
Questions relating to the procedure were 
answered by £. 

Ordered metric 


scaling. —The method of 


TABLE 2 


Sampce Sequence or Dice ann WINNERS 





Offer Die Winner 
1 QU]-QUG QU] 
2 XEG-WUH XEG 
3 QU)-QUG QUG 
4 XEG-WUH WUH 
5 QUJ-QUG QU] 
6 xX 


XEG-WUH 
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TABLE 3 


1. +32 +417 bess 
AB < BE 
- 34 


2. +32 +17 e 
AD >BB AB > BD 


AE>DD AD> DE 


AD>CC AC>CD 


AE>BC AB>CE 


BE<CC BC <CE 


AE>CD AC> DE 
—-H 
. +17 
BE > DD 


BD > DE 


-3 —16 


scaling used here is a direct application of that 
developed by Siegel (8), and the reader is re- 
ferred to that source for a full exposition. 

The first 15 offers showa in Table 1 pertain 
to Scale I. The second 15 pertain to Scale II. 
In each session, thus, two higher ordered metric 
scales of utility of cigarettes were obtained from 
S. The second session constituted a replication 
of the first. 

In Scale 1], A= +32,B=414,C = 3, 
D = — 19,and E = — 34. Thus the objective 
scale of value (the scale representing “face 
value”) is AB > BC > CD > DE (read: the 
distance in objective value between A and B is 
greater than the distance between B and C, 
which is greater than the distance between C 
3 ares, & 

In Scale I], A = + 32,B = +17,C = +1, 
D = — 16,and E = — 34. Thus the objective 
scale of value is DE > CD > BC > AB. 

When the offers are standard for all Ss, as 
was the case in this study, from five to nine 
specific choices must be examined for the 
information necessary for constructing a higher 


ordered metric scale of utility. The remaining 


| 
| 
| 


Sampte Protocor: Scare Il 


% +2 +1 


-x +1 
10, +32 +17 
AC > BB 
+1 +17 
Il. +1 
BD>CC BC>CD 
+1 


CE <DD CD < DE 


BE>CD BC> DE 


AE>BD AB> DE 


AD>BC AB>CD 


decisions yield “surplus” information which may 
be predicted from the other decisions, i.e., from 
the scale. Thus a minimum of six checks (pre- 
dictions of decisions), and usually more, may be 
made for each scale. In the present study, a 
maximin method for deriving ordered metric 
and higher ordered metric scales was used. A 
maximin method is one which maximizes the 
minimum gain of information: the maximum 
amount of information is obtained from a 
minimum number of observations. ‘The method 
used here maximizes the amount of information 
and therefore the number of predictions gained 
from observing a minimum number of S’s 
choices. This standard method dictates the 
order in which the various choices should be 
examined in the derivation of the scale. This 
order depends on the direction of each choice 
In the maximin method, a chart is used which 
indicates the order of examination for al! possible 
scales." 


* This method will be reported in a forth 
coming paper by Siegel. In the meantime, 
copies of the table used for deriving scales by the 
maximin method are available from him. 
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For illustrative purposes, we have chosen one 
S’s scale for presentation. The protocol is 
shown in Table 3. For each of the 15 offers, Ps 
alternative chosen by this S is set in boldface. 
In the case of Offer 1, for example, S chose to 
win 17 cigarettes for sure, rather than to gamble 
on a 50-50 chance of either winning 32 or losing 
34. Since +17=B, +32=@A, and — 34 
= E, his choice is recorded as AE < BB. This 
is shown in the second column of Table 3. 

Ordered metric scaling of utility is based on 
the fundamental proposition that if AE < BB, 
then AB < BE. That is, if S chooses the BB 
bet rather than the AE bet, then for him the 
difference in utility between B and E is greater 
than the difference in utility between A and B. 
For the rationale and proof of this, see Siegel 
(8). The third column in Table 3 translates 
each choice into information about distances on 
S’s utility scale. 

All of the information in the third column of 
Table 3 is not necessary in order to construct a 
higher ordered metric scale. Only certain 
choices need be examined. The standardized 
method based on the maximin principle directs 
the search. According to that method, Offer 10 
is examined first. It is observed that S chose 
the first alternative, indicating that for him 
AB > BC, The direction of this choice dictates 
that one turn to Offer 11. The direction of that 
choice leads one to examine Offer 13, after which 
the maximin principle directs attention to Offer 
12. On the basis of S’s decisions in those four 
offers, one can construct his ordered metric scale 
of utility for the five amounts of of cigarettes. The 
scale is: AB > BC > DE > CD. This may be 
contrasted to the objective scale for the values 
involved, which is DE > CD > BC > AB. 
The difference between S’s utility scale and the 
objective value scale demonstrates that for this 
S utility is not linear in cigarettes. The two 
scales also necessarily lead to different 
predictions. 

The next step is to obtain the higher ordered 
metric scale. Following the maximin chart, one 
examines Offers 1, 2, and 6, in turn, From the 
seven decisions observed, S’s higher ordered 
metric scale of utility is constructed. It is: AE 
> AD > AC > BE > AB > BD > CE > BC 
> DE>CD. This scale is shown graphically 
in Fig. 1. 

Only 7 of S’s 15 choices have been observed. 
The other 8 choices—the “surplus” choices— 
may be used to determine the correctness of the 
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predictive model. In each case, there are two 
alternative models for prediction: (a) the ex- 
pected objective value model, and (b) the 
expected higher ordered metric utility model. 
Consider Offer 5. This offer concerns AE 
and BC. In terms of objective or “face” value, 
AE < BC. That is to say, the objective value 
model would predict the choice of the second 
alternative, because the expected value of the 
first alternative is —1 and for the second 
alternative itis + 9. From observation of other 
choices, however, it has already been determined 
that in S’s higher ordered metric scale of utility, 


AB > CE. From the expected utility model, 
therefore, it is predicted that the first bet will be 
chosen, because if AB > CE, then AE > BC. 
This second prediction is the correct one, as 
Table 3 shows. 

There were eight choices with regard to which 
predictions could be made. The reader will 
observe that for this S all of these choices were 
correctly predicted by the expected utility model. 
Only half of them were correctly predicted by 
the expected value model: these were Choices 3, 
4, 7, and 8, which were correctly predicted from 
both models. The S’s choices on Offers 5, 9, 14, 
and 15 were incorrectly predicted by the ob- 
jective value model but correctly predicted by 
the utility model. 


RESULTS 


Of the 30 Ss, 29 completed the 


experiment. The data from these 29 
were analyzed to test the two 
hypotheses. 


Hypothesis 1.—For each S, four 
higher ordered metric utility functions 
were derived, two for each session, 
according to the procedure discussed 
abov.e For each function there were 
15 relevant decisions, including those 
central ones from which the function 
was derived and those “surplus” ones 
about which predictions could be 
made on the basis of the function. 

The S’s choices which had not been 
used in deriving the function were 
checked for predictability under both 
models: a set of choices was checked 
for predictability from the higher 





A B 
Fic. 1. 


Cc D E 


_Example of Higher Ordered Metric Scale II: 


AE > AD > AC > BE > AB > BD > CE > BC > DE > CD 
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ordered metric utility function, and 
the number of errors recorded, and 
then the same set of choices was 
checked for predictability from the 
objective value function and the 
number of errors recorded. 

For a given S, the total number of 
errors of prediction from the expected 
utility model was determined, as was 
the total number of errors of predic- 
tion from the expected value model. 
The difference between these two 
totals was the difference score, d, used 
in this statistical analysis. The mag- 
nitude of this d represents the com- 
parative accuracy of the two models: 
the larger the d, the greater the 
relative predictive efficiency of one 
model over the other. 

For the purposes of the nonpara- 
metric analysis employed, the ds were 
then ranked. Six Ss had ds of 0; the 
remaining ds were ranked from | to 
23. Fifteen of the ds were in the 


predicted direction, and the sum of 


their ranks was 206. Eight ds were 
in the opposite direction, and their 
sum of ranks was 70. Under the 
null hypothesis, these two sums of 
ranks would be expected to be about 
equal. The Wilcoxon matched-pairs 
signed-ranks test (11) demonstrated 
that the observed sums of ranks 
differed significantly from what would 
be expected under the null hypothe- 
sis: the data revealed that the ex- 
pected utility model yielded signifi- 
cantly fewer errors of prediction than 
the expected value model (P < .025). 
This confirms Hypothesis 1. 
Hypothesis 2.—For a given session, 
the latencies of the choices which were 
erroneously predicted by the expected 
utility model were compared with the 
over-all median latency for the 30 
choices made by S. The number of 
erroneously predicted choices with 
less than median latency was sub- 
tracted from the number with greater 
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than median latency. The indices 
thus obtained for S’s two sessions 
were summed algebraically to yield 
a single index for each S. 

Four Ss had ds of 0; the remaining 
ds were ranked from 1 to 25. Nine- 
teen of the ds were in the predicted 
direction, and the sum of their ranks 
was 298. Six were in the opposite 
direction, and the sum of their ranks 
was 53. The Wilcoxon test demon- 
strated that the latencies of the 
erroneously predicted decisions were 
significantly greater than the median 
latency (P < .0O1). This confirms 
Hypothesis 2. 


Discussion 


The fact that decisions are more 
accurately predicted from Ss’ higher 
ordered metric utility functions than 
from the objective value function is 
direct evidence that for these Ss utility 
is not linear in cigarettes (the social 
equivalent of money in this case). This 
finding is contradictory to that of 
Edwards (6) and supports the findings 
of Siegel (8) and Davidson, Siegel, and 
Suppes (4). The latter found that their 
technique for measuring utility in an 
interval scale yielded predictions which 
were 100% accurate for 15 of their 19 Ss, 
all but one of whose utility functions 
deviated from linearity in money. 
Edwards’ attempt at interval measure- 
ment of utility, with 5 Ss, yielded pre- 
dictions which were correct only 74% of 
the time, and which were exceeded 
(nonsignificantly) in efficiency by pre- 
dictions based on objective value. 

Although the most central reason for 
these differences in findings undoubtedly 
lies in differences in techniques of 
measurement, another reason seems to 
be that Edwards’ selection procedure 
(inviting Ss to participate in a gambling 
experiment), under which it was neces- 
sary to issue 23 invitations in order to 
obtain 5 Ss, yielded a selected group of 
Ss who were unusually willing to take 
risks and who might well be unusually 
experienced in gambling. In such per- 
sons, utility of money may well be more 
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linear than it is in most, and therefore 
the possibility that prediction from 
expected utility will improve over pre- 
diction from expected value is mini- 
mized. Moreover, Edwards’ study ex- 
tended over several months (thus 
requiring the assumption that utility 
remains stable over extended periods of 
time), and the utility curves he derived 
and used for prediction were based on 
averages. Inasmuch as he did not 
control reinforcement, it is possible that 
his finding utility curves which were close 
to linearity in money resulted from an 
artifact in his measurement. In the 
study reported here, the utility function 
was derived in about 15 min., at a single 
session, and reinforcement was ade- 
quately controlled during the measure- 
ment period. Insofar as Edwards may 
have succeeded in measuring utility, his 
results suggest the importance of certain 
parameters which enter into determining 
the nature of a person’s utility of money. 
They cannot be taken as evidence which 
infirms the hypothesis of maximization 
of expected utility. 


SUMMARY 


This study compares the efficiency of two 
models for predicting decisions in an uncertain 
outcome situation: the model based on the 
higher ordered metric measurement of utility, 
and the objective value model. 

In the experiment, Federal prisoners served 
as Ss and cigarettes were the commodity. ‘Two 
hypotheses were tested: 

1, Subjects will make decisions so as to 
maximize expected utility rather than expected 
objective value, and thus the predictions of 
decisions based on Ss’ higher ordered metric 
scales of utility will be more accurate than those 
based on objective value. 

2. The latency times of those choices which 
are not predicted successfully by the principle 
of maximization of expected utility will be longer 
than the median latency time of all choices. 

By a nonparametric test, Hypothesis 1 was 
confirmed at the P < .025 level, and Hypothesis 
2 was confirmed at the P < .0O1 level. The 
first result not only favors the utility model but 
also demonstrates that utility is not linear in 
cigarettes, the negotiable commodity used, 
while the second suggests that errors of pre- 
diction from a higher ordered metric scale occur 
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when one or more of the decisions on which the 
scale is based were made between two alterna- 
tives which were indiscriminable in expected 
utility. These findings support those of 
Davidson, Siegel, and Suppes (4) and of Siegel 
(8), and oppose those of Edwards (6). Although 
this discrepancy is doubtless primarily attrib- 
utable to differences in measurement procedures 
and criteria, differences in procedures for se- 
lecting Ss may also contribute to it. 


REFERENCES 


1. Coomus, C. H. A theory of “psychological 
scaling. Univer. Mich. Engng Res. Inst. 
Bull., 1951. No. 34, vi, 94 p. 

2. Coomes, C. H., & Bearpstez, D. On 
decision-making under uncertainty. In 
R. M. Thrall, C. H. Coombs, and R. L. 
Davis (Eds.) Decision processes. New 
York: Wiley, 1954. Pp. 255-285. 

3. Coomas, C. H., & Komonrita, S. S. Meas- 
uring utility of money through decision- 
making. Unpublished manuscript, Uni- 
ver. of Michigan, 1955. 

4. Davinson, D., Siecer, S., & Supres, P. 
Some experiments and related theory in 
the measurement of utility and subjective 
probability. Stanford, Calif.: Stanford 
Value Theory Project, Rep. No. 4, 1955. 

5. Epwarps, W. The theory of decision 
making. Psychol. Bull., 1954, $1, 380- 
417. 

Epwarps, W. The prediction of decisions 
among bets. J. exp. Psychol., 1955, 50, 
201-214. 

7, Ramsey, F. P. Truth and probability. 
In The foundations of mathematics and 
other logical essays. London: Paul, 
French, ‘Trubner, 1931. 

8. Stecer, S. A method for obtaining an 
ordered metric scale. Psychometrika, 
1956, 21, 

9. Siecer, S., & Bertz, S. Empirical verifi- 
cation of a decision-theory model: Pre- 
diction of choices between uncertain 
outcomes in the discrete case. Paper 
read at Eastern Psychol. Ass., Atlantic 
City, March, 1956. 

10. Von Neumann, J., & Morcenstrern, O. 
Theory of games and economic behavior. 
(2nd Ed.) Princeton: Princeton Univer. 
Press, 1947. 

ll. Witcoxon, F. Some rapid approximate 
statistical procedures. New York: Ameri 
can Cyanamid Co., 1949. 


6. 


- 


(Received for early publication 
March 13, 1956) 














QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY 
Founded in 1948 


Published Quarterly in February, May, August, and November 
for the 
EXPERIMENTAL PsycHo_ocy Group 


Editor: D. Russell Davis, University of Cambridge 
Subscriptions from the United States 


should be entered now for the next year. 
30 shillings per year (current rate $4.20). 





Order from 


W. Heffer & Sons, Ltd. 
Cambridge, England 




















TH 


iy ie he: 


nae 


f Hee 


nual 





